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FORETORD 



The increasing contribution of mathematics to the culture of the modem 
world, as well as its importance as a vital part of scientific and humanistic 
education, has made it essential that the mathematics in our schools be both 
well selected and well taught. 

With this in mind, the various mathematical organizations in the United 
States cooperated in the formation of the School Mathematics Study Group (SMSG) . 
SMSG includes college and university mathematicians, teachers of mathematics at 
all levels, experts in education, and representatives of science and technology. 
The general objective of SMSG is the improvement of the teaching of mathematics 
in the schools of this country. The National Science Foundation has /provided 
substantial funds for the support of this endeavor* 

One of the prerequisites for the improvement of the teaching of mathematics 
in our schools is an improved curriculum — one which takes account of the increas- 
ing use of mathematics in science and technology and in other areas of knowledge 
and at the same time one which reflects recent advances in mathematics itself. 
One of the first projects undertaken by SMSG was to enlist a group of outstanding 
mathematicians and mathematics teachers to prepare a series of textbooks which 
would illustrate such an improved curriculum. 

The professional mathematicians in SMSG believe that the mathematics pre- 
sented in this text is valuable for all well-educated citizens in our society 
to know and that it is important for the precollege student to learn in prepara- 
tion for advanced work in the field. At the same time, teachers in SMSG believe 
that it is presented in such a form that it can be readily grasped by students. 

In most instances the material will have a familiar note, but the presenta- 
tion and the point of view will be different. Some material will be entirely 
new to the traditional curriculum. This is as it should be, for mathematics is 
a living and an ever-growing subject, and not a dead and frozen product of an- 
tiquity. This healthy fusion of the old and the new should lead students to a 
betcer understanding of the basic concepts and structure of mathematics and 
provide a firmer foundation for understanding and use of mathematics in a 
scientific society. 

It is not intended that this book be regarded as the only definitive way 
of presenting good mathematics to students at this level. Instead, it should 
be thought of as a sample of the kind of improved curriculum that we need and 
as a source of suggestions for the authors of commercial textbooks* It is 
sincerely hoped that these texts will lead the way toward inspiring a more 
meaningful teaching of Mathematics, the Queen and Servant of the Sciences. 
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PREFACE 



The present volume is an experimental edition for a high-school course in 
the theory of matrices and vectors. In selecting material for the text, the 
School Mathematics Study Group has been mindful of the fact that this is the 
last mathematics course in secondary school, the terminal course lor many 
students. As citizens, they should have a sound idea of the nature of mathe- 
matics. This point of view has been emphasized in the Harvard report, "General 
Education in a Free Society," Harvard University Press, Cambridge, 1945, which 
states: "Mathematics may be defined as the science of abstract form. The dis- 
cernment of structure is essential, no less to the appreciation of a painting or 
symphony than in the behaviour of a physical system; no less in economics than 
in astronomy. Mathematics studies o^'der, abstracted from the particular objects 
and phenomena which exhibit it, and in a generalized form," 

One of our basic aims is thus to demonstrate the structure of mathematics. 
We shall not be concerned, however, with structure merely as such. Rather, we 
shall exhibit some rich mathematics that is totally new to the student and 
demonstrate structure as we proceed. To make abstract form a topic unto itself 
often leads to a barren presentation; to discuss the structure of the already- 
familiar arithmetic and algebra seems forced and repetitive to the boy or girl 
who is dreaming of a place in a jet age,'^ even in a space age. 

It is important to give the student some "new" mathematics that has con- 
siderable vigor and vitality. Until very recently, the high-school curriculum 
has been almost entire] y concerned with ideas that were developed during or 
before the sixteenth and seventeenth centuries. Computers and electronic brains 
are front-page news. In order to appeal to the imagination of the student and 
to expose some mathematics that is very much alive, the material must be new, 
different, and bold. 

Another criterion is to provide some tools that will be eminently useful 
in the student's transition from school to college, tools that will help bridge 
the gap from the manipulative spirit of high-school mathematics to the abstract 
viewpoint of modern algebraic studies. Yet this material must not come from 
the usual sequential courses. 

' A unit on matrix algebra will satisfy the foregoing criteria. As one 
operation after another is defined, the structure of mathematics can be repeat- 
edly emphasized. Terms like group, ring, field, and isomorphism will be intro- 
duced when ;neaningful and needed for unifying concepts. Thus they will be met 
in a new, appropriate, and substantial context; they will not be applied to 
shopworn material. Introduced by Cay ley in 1858, recognized by Heisenberg in 
1925 as exactly the tool he needed to develop his revolutionary work in quantum 
mechanics, employed today in such diverse ways as providing a language for 
atomic physics, measuring the air flow over the wing of an airplane, and keeping 
the parts inventory at a minimum in a factory, matrices can put thq student 
close to the frontiers of mathematics and provide s tr iking' examples of patterns 
that arise in^the most varied circumstances. Moreover, the student meets some 
mathematics emancipated from the familiar rules of arithmetic, and he learns 
"that it. is within his capacities to "invent" some of his own. If this study 
can make mathematics more alive, then here indeed is a promising path. 



10 



Our study of matrix algebra will involve the investigation of a significant 
postulational system, v?hich will reflect the vigor of abstract mathematics. This 
is a unit in "hard" mathematics that has power and beauty. It will provide an 
effective language and some dynamic concepts that will enhance the student's 
ability to handle his first college courses yet not duplicate material. 

Lastly, with the objective that the intellectually vigorous students may, 
in some small part, obtain an idea of what constitutes "mathematical research," 
there is appended a set of "Resuarch Exercises." These are by no means over- 
night homework and any one of them may well constitute a project to be executed 
by several students* Such team operations are conducive to stimulating dis- 
course and critical thinking. 



Chapter 1 
MATRIX OPERATIONS 



I— I . In troduc tion 

As we have studied more and more sophisticated mathematics ^ v?e have had 
occasion to use more and more sophisticated kinds of "numbers." We began with 

the set of counting numbers, I, 2, 3, Then, in order to make subtractions 

like 3-7 possible, the system was extended to the entire set of integers, 
0, + 1, 4- 2, + 3,.... Next, in order to make it possible to divide any number 
by any nonzero number, rational numbers like 1/2, -2/3, -157/321, and 4/2 were 
invented. This did not bring us to the end of our story, for, in order that 
every positive number should have a square root, a cube root, a logarithm, etc., 
it was necessary to invent still more niambers: the infinite decimals or real 
numbers , such as 1.4142..., 3.1415928;.., and 0 . 13131313 ... . Finally, in order 
that negative numbers should also have square roots, and that such quadratic 
equations as 

x^ + X + 1 = 0 

should have solutions, it was necessary to invent complex numbers like 
3 + 2i, 1 + ici, -1/2 + (l/37)i, and 3 + Oi . 

Whenever there has seemed to be a good reason to do so, we have invented 
new sets of "numbers." For instance, in inventing complex quantities, we 
began not with the quantities themselves but with a purpose: to find a 
system of numbers each of which has a square root. \^en we have made one 
such invention, it is not hard to realize that there is no reason to stop 
inventing. Why should we not hope to invent many kinds of new numbers? 

It is easy to invent things that do not work, but hard to invent things 
that do work — easy to invent things that are useless, but hard to invent 
things that..are useful. The same is true of the invention of new kinds of 
numbers. The hard thing is to invent useful kinds of numbers, and kinds of 
numbers "that work." Nevertheless, several more or less successful new kinds 
of numbers have been invented by mathematicians. In this book, we are going 
to study one of the most successful of these new kinds of numbers: the matrices 

Before we tell you what matrices are, it is well for us to emphasize their 
importance. They are useful in almost every branch of science and engineering. 
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A great number of the operations performed by the giant "electronic brains" 
are computations with matrices. Many problems in statistics are expressed in 
terms of matrices, Matrices come up in the mathematical problems of economics. 
They are extremely important in the study of atomic physics; indeed, atomic 
physicists express almost all their problems in..terms of matrices, and it would 
not be an exaggeration to say thr '^ebra of matrices is the language of 

atomic physics. Many ot^' ebra, such as compleK-number algebrfi 

and vector algebra, whi< y y already have studied, can be ; Uined 

very easily in terms ot wa 'u , 60, in studying matrices, you will be acudy- 
ing one of the newest and most important, as well as one of the most interesting, 
branches of mathematics. 

Let us look at a few simple examples. 

Many a baseball fan, when he first opens the newspaper, refers to a tabula- 
tion similar to the following: 





6 


AB 


R 


H 


Aaron 


68 


280 


52 


109 


Williams 


52 


194 


29 


60 


Mantle 


60 


228 


51 


70 


Lopez 


63 


241 


38 


72 



If he is a Mantle fan, he looks at the entry in the third row and fourth column 
of numbers in order to learn how many hits Mantle has thus far obtained during 
the season «^ 

You will note that we have said *»row" in speaking of a horizontal array, 
and "column" in speaking of a vertical array. Thus, the third row is 



60 228 51 70, 

and the fourth column is 

109 
60 
70 

72 . 



An assembler of TV sets might have before him a table of the following 
sort; J 3 



3 



Model A Model B Model C 

Number of tubes 13 18 20 

Number of speakers 2 3 4 

This table indicates the number of tubes and the number of speakers used in 
assembling a set of each model* 

Omitting the row and column headings, let us focus our attention on the 
arrays of numbers in the last two * 4* jumpier 

68 280 52 109 

52 194 29 60 13 18 20 

228 51 70 2 3 4 

63 241 38 -72 

Such arrays of entries are called matrices (singular: matrix). Thus a 
matrix is a rectangular array of entries appearing in rows and columns. 
Actually, the entries may be complex numbers, functions, and in appropriate 
circumstances even matrices themselves; however, with a few exceptions that 
will be clearly indicated, we shall confine our attention to the real numbers 
with which we are already familiar. 

Some examples of matrices are the follpwing: 



[2 3 4l r 0 v^] 
[l 0 -IJ ' [^.14 ij ' 



4 
2 
0 
-2 



[l/2 1/4 1/8] . (1) 



You will note here how square brackets [ ] are used in the mathematical 
designation of matrices. 

A great advantage of this notation is the fact that we can use it in 
handling large sets of numbers as single entities, thus simplifying the statement 
of complicated relationships • 



1-2 > The Order of a Matrix 

The order of a matrix is given by stating first the number of rows and 
then the number of columns in the matrix. Thus the orders of the matrices in 

[sao* 1-1 J 



if 

the foregoing examples (1) are respectively 2x3 (read "2 by 3"), 2x2, 
4x1, and 1x3. Generally, a matrix that has m rows and n columns 
is called an m x n (read "m by matrix, or a matrix of order m x n. 

If the number of rows is the same as the number of columns, as in the 
second example above, then the matrix is square . Thus, given two linear 
equations in two unknowns. 



2x + 3y = 7, 
Ix - 2y » 0, 



we observ aat 



efficients of x and y constitute a square matrix; 



2 3 
1 -2 



When speaking of a square n x n matrix, we often refer to its order as n 
rather than n x n. For example, the 2x2 matrix 



2 1 

3 4 



is a square matrix of order 2, and the 3x3 matrix 



-12 3 
4-5 6 
7 8-9 



is a square matrix of order 3. 

If the number of rows is 1, as in the fourth example in (1), above, thu 
matrix is sometimes called a row matrix or a row vector . For example, in terms 
of rectangular coordinates, a point in a plane might be designated by the row 
matrix ^2 3j , or a point in space by the row matrix ^2 3 — ij . 

Similarly, a column matrix or column vector is a matrix having just one 
column. Thus, the foregoing points can equally well be designated by column 
matrices, 



or 
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2 
3 
-1 



[seo* 1-2] 
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and the number of men, women, and children In a family might be denoted by 



Capital letters are often used to denote general matrices, and the 
corresponding small letters with appropriate subscripts are then employed 



to designate entries* 


Thus 


, we 


might have 












'«ii 


«12 


«13 




B - 


'^1 ^2 






A - 


«21 


^22 


^23 




.''21 ''22 


''23 






«31 


^2 


«33 













In these examples, the entries Located at the Intersection of the 2nd row and 
3rd column are denoted by a^^ and b^^, respectively. 

Generally, the entry located at the . Intersection of the 1-th row and 
j-th column of matrix A Is denoted by a^j . An m X n matrix can be denoted 
compactly as j^a^^j__. Thus the foregoing matrices A and B 



are 



mx n 



A - [^j] 3X3 ^""^ Kj] 2X3- 

If the order is cle«r from the context or Is h Itrary, the notation might be 
rcBuced to 



hi] 



and 



Associated with each matrix Is another matrix called its transpose , which 
is often convenient to use and has Interesting theoretical properties. The 
transpose A^ of a matrix A Is formed by interchanging its rows and columns. 
For example, if 



A - 



122] 
3-1 Oj * 



then A 



1 3 

2 -1 
2 0 



Definition 1-1 * If A - [a^jj 
&} of A is the n X m matrix B - 



Is an B3 X n matrix, then the transpose 
with b^j " *jl 
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if j (i - 1,2 n; j « 1,2 m) . 



Exercises 1—2 



1. (a) Obtain from a newspaper or other similar source six examplos of 

information presented in matrix form. 

(b) In each of your examples, state the order of the matrix, 

(c) In each of the examples, suggest an alternative method (not in matrix 
form) of prei?enting the same information. 

2. A row vector vith three entries can be used to tabulate a person's age, 
height, ..;.d weight. 

(a) Give a row vector that lists your age, height, and weight. 

(b) Suggest when it might be useful to employ such a vector. 



Let 



A » 



1 2 3 4 5 

8 10 12 14 16 

"1 "3-5 6 3 

0 3 -7 8 7 



f fli) fesLt is the order of A? 

(b Nane the entries in the 4th row, 

Cc> P^toe the entries in the 3rd column. 

(d) HtatR the entry a^^. 

<a) fiimt the entry a^^^. 

(f) K^ene the entry a^j^. 

{gj Write the transpose A^. 



4. 



10)0 

0 1 LJ 0 

0 0 10 

0 0 0 1 



t^t^) i^4J6.t is the order of B7 
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[sec» 1-2] 



(b) Name the entries in the 3rd row. 

(c) Name the entries in the 3rd column. 

(d) Name the entry h^^' 

(e) For what values i, j is b^j 9* 0? 

(f) For what values i, j is b^^ « 0? 

(g) Write the transpose B^. 

5. (a) Write a 3x3 matrix all of whose entries are whole numbers. 

(b) Write a 3x4 matrix none of wHose entries are whole numbers. 

(c) Write a 5x5 matrix having all entries in its first two rows 
positive, and all entries in 'Xs last three rows negative. 

6. (a) How many entries are there in a 2x2 matrix? 

(b) In a 4x3 matrix? 

(c) In an n x n matrix? 

(d) In an m X n matrix? 



1—3. Equality of Matrices 

Two matrices are equal provided they are of the same order and each entry 
in the first is equal to the corresponding entry in the second. For example, 



14 0 
2 8 4 



1 2x22-2 
4/2 16/2 8/2 



2 




■2^" 




- 
















4 


83 


2^ 
















X 




8 




2^ 









(x-l)(x+l) 

X 



but 



12 3 

4 5 6 



1 4 

2 5 

3 6 



Definition 1-2 . Two matrices A and B are equal , A » B, if and only 
if they are of the same order and their corresponding entries are equal. 
Thus, 18 



[sec. 1-2] 



8 



mx n 



If and only if a. ^ » b^^ for each l,j(l-l,2 m; j«i,2 n) . 

Using the foregoing definition of equality, we can express certain 
relationships more compactly. For example, the following equation between 
2x1 matrices. 



can be employed instead of the tv?o separate equations 



2x + 3y « 7, 
3x - y - 2; 



and 



Fx + y a + b] ^ 
I X — y a — b 

can be written in place of the four equations 



x+ya5, a+b»-l, 



x-y»l, a-b 



3. 



Exercises 1—3 



1. Solve the following equations: 



(a) 


'x + 2 ■ 




.3- y 


(b) 


X — 2y 




X + y 


(c) 


■ 2 

X y 




2 




y . 



[-3] 



1 
-1 
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2. From the matrix equalities A B and B « C, would you conclude that 
A » C? Why? 

3. Write the matrix 



^11 ®12 ®13 
®21 ®22 ®23 



if 



a^j « 2i + 3j - 4, 

4. Write the matrix whose entries are the sums of the corresponding entries 
of the matrices 



1 0 

2 -1 
-3 4 
0 1 



and 



0 2 

-3 4 

2 1 

0 0 



5. Write the matrix whose entries are the differences (first minus second) of 
the corresponding entries of the matrices in Exercise 4« 



1-^. Addition of Matrices 

We have now defined matrices and studied some of their most elementary 
properties. But we have not really made them work. To do this, we must give 
rules for adding and::mul tip lying matrices, just as was done, for example, with 
complex numbers. If these numbers were defined bluntly as expressions of the 
form a + bi, without the operations of addition and multiplication, and with- 
out relation to the solution of such equations as 

x^ + X + 1 « 0, 

they would be of relatively little interest. What gives life to complex 
numbers is the fact that we are able to define addition and multiplication for 
them in such a way that we have a whole algebra of complex numbers, which is 
tutted useful and interesting. 20 
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The same remark applies to matrices. To give the study of matrices its 
real content, we must define "sum" and "product" for matrices. In this section, 
we define and study sums of matrices. Products will be considered later. 

You will recall th&t when two complex numbers are added, for example 
3 + 5i and -2 + 4i, the two real components and the two imaginary crn ^nents 
are added separately. Thus, 

(3 + 5i) + (--2 + 41) = (3 + (-2)) + (5 + 4)i = 1 + 9i. 

If we represent the complex numbers as column vectors, we find their sum 
by adding corresponding entries; thus. 



—2 
4 



This suggests the pattern used in adding matrices of the same order. The 
sum of two such matrices is obtained by adding the individual entries in cor-- 
responding positions. For example. 



2 3 1 
"10 4 



2 

1 3 



-2 5 2 
0 3 2 



Since we shall not even give a rule by which matrices of different orders 
could be added, we shall add two matrices only if they are of the same order. 
Accordingly, two matrices that have the same order are sometimes said to be 
conformable for addition . The sum has the same order as the two addends. 

Deflrition 1-3 . The sirm A + B of two ra x n matrices A and B is 
the m X n matrix C such that the entry c . in the i— th row and j— th 



column of C is equal to the sum a^^ + b^^ of the entries a^^ 
in the i— th row and j— th column of A and B, respectively. 
Thus, 



and b 



ij 



hj]mxn-^ [^j]mxn^ hj-^\j]mxn' 
For instance, 2 1 
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If we consider all m X n taatrlccs, with m and n fixed, as constituting 

a set S , and if A and B are elements of ^, then A + B is also 

an elemellt^of this set. That is, if A € S^^^ (read "A is an element of 

S and B e S , then (A + B) € S . 

m,n ni,n 

In the algebra of real numbers R, the equation 



a + 0 



Is satisfied for all a € R (this time, read "for all a e r" as "for all • 
elements a of R") . Accordingly, we' say that 0 is the identity element for 
addition in R. In the algebra of matrices, the matrices all of whose entries 
are 0 play a corresponding role. Thus, 



[4 i\ ^ [s I] 



2 
-1 



+ ol 
+ oj 



2 3 
-1 4 



Such a matrix is called a zero matrix and is denoted by 0. If the order 



m X n is significant we write 
write 0_ 



mx n' 



or, if the matrix is square, we might 



where n indicates the order of the matrix. Thus, 

[o o] , 



1X2 



2X3 



0 0 
0 0 



°3- 



0 0 
0 0 
0 0 



The equation 



A^ +0 « A 
^X n mxn 



mx n 



clearly is valid fsar all Aj^^n' 

The addittcm of matrices is a cosmutative operation, as we can readily 

verify. Thus, 



«11 ^12 ^3 
^21 ^22 ^23 



^1 ^2 ^13 
^21 ^2 ^23 



^11 ^2 ^13 
^21 ^22 



^11 ^12 ^13 
^21 ^22 ^23 
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In particular, the suit, of the two matrices on the left is a matrix having 

^12 ^12 element in the first row and second column, and the corresponding 

element of the sum on the right is b^^^ + a^^. But 



^2 + hz' 



by the commutative law for the addition of real numbers. 

The foregoing observation holds generally, of course, so that we have the 
following result: 

Theorem 1-1 . If the matrices A and B are conformable for addition, 
then they satisfy the commutative law for addition: 

A + B « B + A. 

Proof. We have 



A + B = 



B + A. 



Thus, in terms of our usual notation, the entry in the i-th row and 

j-th column of the sum on the left is a^^ ' > corresponding 

entry of the sum on the right is b. , + a 

ij ij 



But 



a. . + b. . = b. . + a. . 

ij iJ ij ij 



by the commutative law for the addition of real numbers; hence the theorem 
follows from the definition (Definition 1-2) of the equality of two matrices, 
The addition of conformable matrices is also associative; that is. 



A + (B + C) « (A + B) + 

23 
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[4 0 e] [3 I 3] " [-1 I 9] ' 



and also 



1-^ 0 ^1 

1-15 1] 
[-6 0 7j 



-12 0 
-2 0 



\ It] '[-III] 



We can state the associative property as a theorem and prove it, as 
follows: 

Theorem 1-2 > If the matrices A, B, and C are conformable for addition, 
then they satisfy the associative law for addition: 

A + (B + C) « (A + B) + C. 

Proof > We note that, in terms of our usual notation, the entry in the 
i-th row and j-th column of the sum on the left is a^j + (b^j + c^j), and 
the corresponding entry of the sum on the right is (a^j ^ij^ ^i j * 

You can complete the proof of Theorem 1-2 by telling why this last equality Is 
valid for all real numbers a^j, b^j, and c^j, and why this equality implies 
the matrix equality 

A + (B + C) <• (A + B) + C. 

Since it is immaterial in which order the matrices are added, we write 
A + B + C for either expression: 

A + (B + C) = (A + B) + C " A + B + C. 
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Once we know how to add numbers, it is usual to consider subtraction. You 
will recall that the negative, which we might call the additive inverse , of the 
real number a is denoted by —a. It satisfies the equation 

a + (-a) = 0. 

Subtraction of matrices arises in a similar manner. 

Definition 1—4 . Let A be an m X n matrix. Then the negative of A, 
written -A, is the m x n matrix each of whose entries is the negative of 
the corresponding entry of A. 

Definition 1-5 . If A and B are two m X n matrices, then the 
difference of A and B, designated by A — B, is the sum of A and the 
" negative of B. 

Thus, for A + (— B) , where A and B are matrices of equal orders, we 
write A — B and say that the symbols indicate that B is to be subtracted 
from A. For example. 



2 


1 


3' 




^0 


1 


-1 




'2 


0 


5" 




4, 


, 0 


-1 




3 


4 


1 




1 


-4 


-2 





and 



1 t-c' 




'1 t 




"0 


— c 


t+c 4 




c 2 




t 


2 



Now we can easily prove the following thaorem: 

Theorem 1—3 . If A and B are m X n matrices, and 0 the m X n 
zero matrix, then 

(a) A + (-A) =0, 

(b) - (-A) = A, 

(c) - 0 = 0, 

(d) _ (A + B) = (-^) + (-B). 

25 
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Proof of Theorem 1-3 (a) > The entry in the i-th row and j-th column of 
-A is, by definition, j • ^^"^ entry in the i-th row and j-th column 

;r6£ A + (-A) is a^j + ^*"^ij^* ^ij ^""^ij^ ° ^* nence every entry 

of A + (-A) is zero; that is, A + (-A) is the zero matrix. 

The proofs of the remaining parts are similar and are left to the student 
as exercises • 



Exercises 1—4 



I. Find values x, y, a, and b that satisfy the matrix relationship 



X + 3 2y - 8 
a + 1 4x + 6 
b - 3 3b 



0 - 6 
-3 2x 
2b + 4 -21 



If 



3 2 1 
4-5 6 
0 8-3 

4 6 8 



and B " 



-3 


4 


8 


-2 


.6 


-1 


0 


2 


3 


4 


-1 


8 



determine the entry in the sum A + B that is at the intersection of 

(a) the 3rd row and 2nd column, 

(b) the 1st row and 3rd column, 

(c) the 4th row and 1st column. 
3. Compute 



1/2 1/3 
1/4 1/5 



1/6 1/7 
1/8 1/9 



Compu te 



1/2 


1/3 


1/4 




1 


0 


0 


1/5 


1/6 


1/7 


+ 


0 


1 


0 


1/8 


1/9 


1/10 




0 


0 


1 



5. Gompute 



26 
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X 


y 


z 




1-x 


-y 




p 


8 


t 


+ 


-? 






u 


V 


w 








1-w 



6. (a) Does the expreaalon 



3 2 1 
13 2 
3 12 



+ 0. 



make sensa? 

(b) Does the expression 



3 2 1 
13 2 
3 12 



make sense? 

(c) What is the latter sum? 
7 • Compute 



110 
10 1 
>/5 1 0 



3 2 1 

4 17 8 
9 6 14 



0 13 
14 8 6 
1+v^ 11 11 



8. Compute 



12 3 
4 5 6 
7 8 9 



9 8 7 
6 5 4 
3 2 1 



0 0 0 
0 0 0 
0 0 1 



10 10 10 
10 10 10 
10 10 10 



9 • Given 



1 2 
3 4 
5 6 



compute Che following: 



2 -1 

3 -2 
0 1 
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and C ■ 



4 2 
1 0 
-2 -4 
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(a) 
(b) 
(c) 



A + B, 

(A + B) + C, 
A + (B + C), 



(d) A - B, 

(e) (A - B) -I- C. 

(f) B-A. 



10. (a) In Exercise 9, consider the answers to parts (b) and (c) . What law 
Is Illustrated? 

(b) In Exercise 9, consider the answers to parts (d) and (f ) . What con- 
clusion can be drawn? 

11. Prove Theorem 1-3 (b) . 

12. Prove Theorem 1-3 (c) . 

13. Prove Theorem 1-3 (d) . 

14. Assuming that A and B are conformable for addition, prove that 
A^ -f B^ » (A + B)^. 



1-5. Addition of Matrices (Concluded) 

The theorems given In Section 1-4 Include exact analogues of all the basic 
laws of ordinary algebra, Insofar as these laws refer to addition and subtraction. 
We know that all of the more complicated algebraic laws concerning addition 
and subtraction are consequences of these basic laws. Since the basic laws of 
the addition and subtraction of matrices are the same as the basic laws of the 
addition and subtraction of ordinary algebra, all the other laws for the 
addition and subtraction of matrices must be the same as the corresponding laws 
for the addition and subtraction of numbers. We can state this as follows: 

Insofar as only addition and subtraction are involved, the algebra of 
matrices Is exactly like the ordinary algebra of numbers . 

So you do not have to study the algebra of addition and subtraction of 
matrices — you already know Itl But now the algebra that you already know 
has a new and much richer content. Formerly, It could be applied only to 
nianbers. Kow, It can be applied to matrices of any order. Thus, we have 
obtained a very considerable result with a very small effort, simply by 
observing that our old algebraic laws of addition and subtraction apply not 
only to numbers, but also to quite different kinds of things, namely, matrices. 
This very powerful trick of putting old results In new settings has been used 
many times, and often with great success. In the most modem mathematics. 
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A good example of the general principle emphasized above is provided by 
the following problem. Suppose that A and B are known matrices of the 
same order. How can we solve the equation 

X + A « B 

for the unknown matrix X? The answer is easy. We do exactly what we learned 
to do with numbers. Add the matrix —A to both sides. This gives 

X + A + (-A) » B + (-A) « B - A. 

Since A + (-A) « 0, and X + 0 « X, we have 

X « B - A. 

This is our solution. 



Exercises 1^5 



1. Solve the equation 



for the matrix X. 
2 • Solve the equation 



X + 



0 0 1 
0 10 
10 0 



2 12 

3 2 3 

4 3 4 



for the matrix X. 
3. If [x^ X3] - [-6 0 2] 



- [-6 2 -3] , 



determine 



4. If 



determine 



29 
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If 



2 -3 
4 0 



L^l ^2, 



-3 4 
5 -I 



^rt 2 that if the matr; / r A, B, and C axe coirformable for addition, 
I (A + C) - (A + - C B. 

the equation 



10 0 
0 2 3 
10 4 



10 0 
0 2 3 
10 4 



- f 



2 2 
1 2 



valid? 



1-6. Multiplication of a Matrix by a Number 

Once we know how to add numbers, It Is customary to define* 2x as the 
sum X + X, 3x as the sum 2x + x, etc. Fractional parts of x are defined ' 
by requiring that (l/2)x + (l/2)x « x, (l/3)x + (l/3)x + (l/3)x « x, etc. 
All of this can readily be done with matrices. If we add two equal matrices, 
the sum is clearly a matrix in which each entry is exactly twice the correspond- 
ing entry in the two given matrices. Thus 



2 3 
-1 0 



2 3 
-1 0 



4 6 
-2 0 



2(2) 2(3) 
2(-l) 2(0) 



Likewise, for three equal matrices we have 



2 3 
-1 0 



2 3 
-1 0 



6 9 
-3 0 



3(2) 3(3) 
3(-L) 3(0) 



Each of the above sums may be considered to be the product of a number and 
a matrix « We write 
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2 
-L 

2 
-L 



4 6 

-2 0 

6 9 

-3 0 



def±i:' tit. crlx (I/2)A, is clearly satisfied by the matrix each of vhoss 
entxlt * iexac::ly 1/2 the corresponding entry of A; the eqxiation 

■|a -^A + ^ - A, 

defii\ 4 tm issrrix (1/3)A, is clearly satisfied by the matrix each of whose 
entri - \^ac2£tly 1/3 the corresponding entry of A. 

: .ese cmsiderations lead us to make the following general definition^ 



X^, ndtxon 1—6 . The product cA » Ac of a member c and an m nc n 

matrix is the m x n matrix B such that the entry b,. in the i— th 

Ij 

row and j^t2i column of B is equal to the product ca^j of the number c 
and tii£ ^ntxy a^j in the i—th row and j— th column of A. 



Thts?, 



[""ij] mxn " [^ij] mxn ^ [''^ij] mXn' 



For exiijniv:^ 



^1 








caj^2 












^=^22 


• 


«31 


«32 




^=^31 


^=^32 





Kote ttet here we Jiave defined the product of a matrix by a number , not 
the product o:f two matrices. It is possible also to define the product of two 
matrices; th±s will be done in Section 1-^7. 

■lim^ we may state the following theorem about products of matrices by 
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Theorem 1-^ . If A and B are m X n matxt^s?. ^nd x astd y are 
mnxnberSy then 

(a) x(yA) « (3cy)A, 

(b) (x+ y)A »« xA + 

(c) (^L)A « - A, 

(d) x(A + B) « xA + 23b 

(e) a 0 , 

(f) OA « 0 . 

Part (e) states that the product of a number ansr. thie zero matarix is the 

; zero matrix, and part (f) states that the product o± rhi^ zero nanhsrr and any 
matrix Is the zero matrix. 

Proof of Theorem (d) . The entry In the 1— th row and j— th column of 
the matrix A + B Is a^j + b^^ . The entry In the 1-th row and th column 
of matrix x(A + B) is therefore, „by_definiti^^^^ x(a^j + ^ij^' Now the entry 
In the 1— th row and j— th column of the matrix xA is ^^^j* tha± In the 
1— th row and j— th column of the matrix xB is xb^j . Thus the entry In the 
1— th row and j-th column of :he matrix xA + xB Is xa^^ + ^^^y Since the 
entries are numbers and, for all numbers, a(b + c) » ab + ac, we have 

x(a^j ^ b^j) « xa^j + xb^^, 

so that each entry in the matrix x(A + B) is the same as tiie corresponding 
entry of the matrix xA + xB. Hence, 



x(A + B) « xA + xB. 

The other parts of Theorem 1.4 may be proved in a stanxiar «ay. 

When we studied the laws governing the addition and subctractlon of 
matrices, we saw that they were parallel to the laws goverarng rsddltlon and 
subtraction in ordinary algebra. The situation when we ccnne to the multlpllca*- 
tion of matrices by numbers is rather similar, but not exactly laei same. The 
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various parts of Titiorsm 1-4 n;?ienble the basic algebraic laws for multiplica^ 
trion very closely, ua, many af r.he more complicated ordinary algebraic laws 
and procedures gover ^ng mui^zlpiication still remain correct for expressions 
involving the multl: . i cation of ms^crices by numbers. difference is that while 

the product ol a nimhsr by a number is a number, the i rrr^uct of a matrix 
■fcy a number is not number but a aatrix * 

We are now able zo soT;*^ 2Jcm£ matrix eTOations invr ving addition, 
:siib traction, and multiplicaricai bj a number- Let us lo^r. at an example. 

Suppose we want to soIts: the equation 



-2 X + 



12 3 
g 1 2 
0 0 1 



« 3X + 



10 0 
0 0 0 
0 0 1 



We first perforra the indicated multiplication by —2. in accordance with part 
(d) of the above theorem, to get 



-2X + 



-2 -A -6 
0-2-4 
0 0-2 



= 3X + 



10 0 
0 0 0 
0 0 1 



Then we add 2X to both sidss of the equation to obtain 



-2 -6 
0 -2 -A 
0 0-2 



= 3X + 2X + 



10 0 
0 0 0 
0 0 1 



Nest we use part (b) of the theorem to find that 3X -r 2X = 5X, so that 



-2-4-6 
0 -2 —A 
0 0-2 



5X + 



10 0 
0 0 0 

0 0 1 



Adding 



10 0 
0 0 0 
0 0 1 



to both. sides, we igtnd that 
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-3 ~4 --6 
0-2-4 
0 0 --3 



5X. 



J±:l.tiplying both sices .-of this last equation by 1/5, we see by part (a) c 
tsrre theorem that 



-3/5 -4/5 -6/5 
0 -2/5 -4/5 
0 0 --3/5 



Hlxta is our solution. 



Exercises 1--6 



i'or 



A = 



2 1 -3 
10 4j * 



3 0 5 
6 9-1 



and C 



5 -1 
7 8 



detrexmine the result of the following operations: 



(a) 2A - B + C, 

(b) 3A - 4B - 2Cy 



(c) 7A - 2(B - C), 

(d) 3(A - 2B + 3C) . 



For 



0 4 



B = 



:= 3 3 

3 0 5 
6 9-1 



and C = 



4 4 4 
5-10 
7 8-1 



dejDsimiire rfie result of the foilowing operations: 

(a>: 2A - 31+ C, (c) 7A-2(E-c), 

(i> 3A-m 9C (d) 3(A - 2E + 3C) .1 

3. Let A, 3^ and S, be the matrices of Exercise 2.' Solved the equation 



j-a + A) = 3(X + (2X + B)) + C, 
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giving iill thi: stepjj in detail, and justifying each step. 

4. Let A, B, axn: C be the rsatrices of Exercise 2. Solve the equation 

2(X + B) = 3(X ^ jiX A)) + C. 

5. Prove TheareEr /j.— 4 a). 

6 . Prove Ther reri 1—4 { b) , 

1-7, Maltiplicaticrr of Matrices 

Thus far, we have defined and studied the addition and subtraction of 
matrices and the ratuLtiplication of a matrix by a number. We sttll have not 
defined the pvodxxd: of two matrixes. Since the formal definition is somewhat 
complicated and may -sX first seesn odd, let ois look at a simple pcractical problem 
that will lessH Tis tc operate wit^c cto matrices in the way that we shall 
ultimately call mulriplication. 

In Secrion 1—1, the number c£ tubes and the number of speakers used in 
assembling XV sets c± three different models were specd.fied by ;e tflible: 

Model A Model B Model C 

Number of tubes 13 18 20 

Number of speakers ^ 3 4 

This array v^ \1 called the parts-t^r^-set :2atr±x. 

Si .'pjQSe orders warsi received in i^nuary for Z2 ssts of moc i h» 24 sets 
of moiseL^ B, 'Bn^ 12. sscs ofrmodel C; and in Ee&ruary 6 sets model A, 
12 cf:.rCTdel B and 9 of model C. We can arrsnge the informatic in the form 

of a .riasicrix:: 

February 

6 
12 

9 

m 

ttes will be called the sets-pe^^-month matrix. 
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January 

Model A 12 
Model B 24 
Model C 12 



To determine the number of cubes and speakers required in each of the 
months for these orders, it is cljaar t±at we must use both sets of information. 
For instance, to compute the number or tubes needed ±n January, we multiply each 
entry in the 1st row of the parts— per^et matrix by zhe corresponding entry in 
the 1st column of the sets— per^onth -matrix, and then add the three products. 
Thus, the number ox tubes required iv. January is 

13(12) + liBi(24) + 20(12) « 828. 

To compute the nCTiter of speakers ne^iided in January, wermultiply each entry 
in the 2nd row of rhe parts--per-set :i5strix by the corresponding entry in the 
1st column of the s^ets— per-month mat~x and then add the products. Thus, the 
number of speakers for January is 

2(12) + 3(:m) + 4(12) := IM^. 

For February, ficst we maltiply the entries from ±he Ls:t row of the parts--per— 
set matrix by the corresponding enrrieiS from the ,2nd cotimnr of the sets^er- 
month matrix and add to determrne tfas: ntniber of tabes; secondly, we multiply 
the entries from the.' 2nd mw ox the .psL^ZB ^eT-so^ ma:t:rix iiy the corresponding 
entr-'.es from the Itnd cslinsca of trhe 3etg-pg:t"mc:nttL ^jiatrjix snd add to determine 
the number of speakett^. Thus .zae nuaibers of rubes mixi spaakers for February 
are, respectively, 

13(6 ^-23:.}jr}' + 20(9) 474, 

and 

2(6) + 3(12) + 4(9) = 84. 

We can arrar^- dae four scts tn an array,, wh±ch we shall call the par ta- 
per— month nia t ■ 

January February 

Number o±. tubesr 828 474 

Number o±' speakers 144 84 
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Can we now represent our "operation" in equation form? Let us try: 



13 18 20 
2 3 4 

We have "multiplied'* the parts-per-set matrix by the sets-per-month matrix to 
get just what should be expected, the parts-pei>-month matrlxl 

Note that, in Equation (1), 828 equals the sum of the products cf the 
entries in the 1st row of the left-hand factor by the corresponding entries 
in the 1st column of the right-hand factor. Likewise, 474 equals the sum of 
the products of the entries In the 1st row of the left-hand factor by the 
corresponding entries in the 2nd column of the right— hand factor, and so on. 
Consider the "product" matrix 

'828 474' 
144 84 

in the symbolic form, 

^11 ^12 
^21 ^22] 

The subscripts indicate the row and column in which the entry appears; they 
also indicate the row and the column of the two factor matrices that are 
combined to get that entry. Thus, the entry a^^^ in the 2nd row and 1st 
column Is found by adding the products formed when the entries In the 2nd 
row of the left-hand factor are multiplied by the corresponding entries in the 
1st column of the right-hand factor. The most concise description of the 
process Is: " Multiply row by column ." 

The description, " Multiply row by column ," of the pattern in the foregoing 
simple practical problem serves as our guide in establishing the general rule 
for the multiplication of two matrices. Very simply the rule Is to multiply 
entries of a row by corresponding entries of a column and then add the products 
Thus, given two matrices A and B, to find the entry in the 1-th row and 
j-th column of the product matrix AB, multiply each entry in the 1-th row of 
the left-hand factor A by the corresponding entry in the j-th column of the 
right-hand factor B, and then add all the resulting terms. Since there must 
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12 6 
24 12 
12 9 



828 474 
144 84 



(1) 



be an entry in each row of the left— hand factor to match with each entry in • 
column of the right-hand factor, and conversely, it follows that the product 
is not defined unless the number of columns in the left-hand factor is equal 
to the number of rows in the right-hand factor. When the number of columns 
in the left-hand factor equals the number of rows in the right-hand factor, 
the matrices are conformable for multiplication , 

A diagram can aid understanding; see Figure 1—1* 




r 



Figure 1—1. Matrices A and B that are conformable 
for multiplication. The number of columns of A must 
be equal to the n^jmber of rows of B. Then the product 
AB has the same number of rows as A and the same 
number of colur^ifis as B. 



An entry in the product AB is found by multiplying each of the p 
entries in a row of A by the corresponding one of the p entries in the 
column of B and taking the sum; see Figure 1—2 . 
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j-th 
column 

of 
B 




Figure L-2. Determination of an entry in the product 
AB of matrices A and B that are conformable for 
multiplication. 



Thus, for' the matrices 



12 3 

4 5 6 
7 8 9 



and B 



1 0 

2 1 
4 1 



to form the product AB, we compute as follows: 
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Determining one entry of the product after another in this way, we finally 
obtain the cornicle te answer for the product AB: 





1 


2 


3 




1 


1 

0 




17 


5 




AB =» 


4 


5 


6 




2 


1 




38 


11 


• 




7 


8 


9 




4 


1 




59 


17 





(Chp^K each of t±:e entries of the answer yourself.) To get the answer, 18 
mul^lp 11 cations ;Hnd 12 additions of pairs of numbers are necessary. It might 
help to think of the first matrix in terms of its rows, 



and second in terms of its columns, 
TEaExr tSE- pxnduct appears as 



AB 



[^1 



Vl V2 
^2^1 ^2^2 
^3^1 ^3^2 



Eere, of course, (^^^ example) represents not a product but a sum of 

products:: 

R^C^ = (1)(1) + (2)(2) + (3)(4) = 17. 

The S3nmol RjC^ represents the entry in the 3rd row and 2nd column, and you 
can: decide for yourself what symbol represents the entry in the i— th row and 
j^tTTtPt column of the product. 

Here are some more examples: 
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(a) 



(b) 



(c) 



L 2 
3 L 
-1 2 



L 2 3 
4 0 



3 



1(1) + 2(4) 
3(1) + 1(4) 
-1(1) + 2(4) 



9 2 5 
7 6 10 
7 -2 -L 



H?.) + 2(0) 1(3) + 2(1) 
3(2) + 1(0) 3(3) + 1(1) 
-1(2) + 2(0) -1(3) + 2(1) 



[l(2) + 7(4) + 3(1)] - [33] , 



2(1) 2(7) 2(3) 
4(1) 4(7) 4(3) 
1(1) 1(7) 1(3) 



2 14 6 
4 28 12 
17 3 



Let us now proceed to define multiplication formally. 



Definition 1-7 . Let 

A = fa,.! and B ■» f b ., 1 

L"ij J mx p [ jkj px n 

be matrices of order m x p and p X ti, respectively. The product AB 
Is the matrix of order mi X n, of which the entry in the 1-th row and the 
j— th column Is the sum of the products formed by multiplying entries of the 
i-th row of A by corresponding entries of the j-th column of B. 

The definition of the product of two matrices can be expressed In terras 
of the " notation" for sums. Recall that, In the " ^ notation," we write 
the sum 

12 p 

of p ntimbers as 

s - i; X . 

jol -I 
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For example, 

J « 1^ + 2^ + 3^ + 4^ + 5^ « 55- 

Again, the familiar formula for the sum of the first p positive integers, 

1 +2 + ... +p -E(L_tli . 

can be expressed as 



In this notation, the sum 



Is expressed as 

P 



You will recognize this as the element in the third row and fourth column of 
the matrix AB* More generally, 

«ilhk + ^2''2k + Vpk 

is expressed as 

this sum Is the element in the i-th row and k-th column of AB. Thus we can 
express Definition 1-7 more compactly as follows: 
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Definition 1-7' . Let 



Xp 



and B » 



[^jk] pxn 



be matrices of order m x p and pxn, respectively. The product AB is 
the matrix of order m x n, given by 



[«ij].xp [V] 



px n 




mx n 



mx n 



Note that we have defined the product of two matrices only when the number 
»of columns of the left-hand factor is the same as the number of rows of the 
right-hand factor. Also note that the number of rows in the product is the 
same as the number of rows in the left-hand factor, and that the number of 
columns in the product is the same as the number of columns in the right-4iaud 
factor. 



1. Let 



Exercises 1—7 



2 0 3 
-14 9 



12 3 

4 5 6 

7 8 9 

10 1 



and D 



10-1 
0-11 
-1 0 11 



-1 -1 

2 2 
-3 -3 



State the orders of each of the following matrices: 



(a) AB, 

(b) DA, 

(c) AD, 

(d) CB, 



(e) BD, 

(f) D(AB), 

(g) (CB)(DA), 

(h) B(DA). 



2. Perform the following matrix multiplication, where possible: 
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(a) 

(b) 

(c) 

(d) 

(e) 

(f) 
3. Let X 



L 2 3 4 



L 

2 
3 
4 



I 

2 
3 
4 



12 3 4 



2 3 4 
-1 -2 0 



4 2 
6 I 
3 5 



4 2 
6 L 

3 5 



:] 

[ 2 3 4] 
[-L -2 Oj • 



I 2 
0 2 



2 3 4 
2-16 



4 2 
L 3 



« [2 -2 4] , Y « [o L 2] , 



and W « 



-L 

2 



Compute the following: 

(a) 5UX, 

(b) (5W)(3Y), 

(c) 5XU - (2X ~ Y)W> 



(d) XU + YW, 

(e) (U - W)(X + Y), 

(f) (X + Y)(U - W), 



4. Perform the following matrix multiplications: 



<^> [l 1] [2 ,0] ' 



(b) 



(c) 



10 0 
0 10 
0 0 1 



.\ "2 



h "^2 "^3 

^1 ^2 ^3 
t3 



2 0 0 
0 2 0 
0 0 2 
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3* 



(d) 



(e) 



(f) 



(g) 

5, For the matrices 



0 0 

0 

0 a, 
D 0 

D 0 



^2 ^3 



1 2 
0 0 
0 1 



0 10 
10 0 
0 0 1 



1 -1 
0 2 



1-10 
4 0 1 



and C = 



test the rule that (AB)C = A(BC) , 
6. Let 



A = 



12 3 
4 5 6 
7 8 9 



and B = 



1 0 -1 
-10 1 
-2 0 1 



Compute 



(a) 


AB, 


(f) 


A(B + B*^), 


(b) 


AB*^, 


(g) 


A(B - b''), 


(c) 


BB*^, 


(h) 


AB - AB*^, 


(d) 


(AB)B^ 


(i) 


AA - BB + bS 


(e) 


a(bb'^), 


(j) 


(AA)A. 



7. Let I denote the identity matrix of order 3 (see page 46) 



I « 



10 0 
0 10 
0 0 1 
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Let A and B be as in Exercise 6. Compute AI, BI, and B^I. Compute 
(AI)I and ((AI)I)I. 

8, Let 



[1 



-3 2 
2 I 



and B 



Find (AB)^ and B^ A^- 
9. For a certain manufacturing plant, the following information is given: 



Part 1 Part 'I Part 3 
Coat 2 3 5 



Subassembly 1 Subassembly 2 



Part 1 
Part 2 
Part 3 



4 
3 
7 



1 

5 
2 



Model 1 Model 2 ModieL '2i 



Subassembly 1 2 
Subassembly 2 3 



1 

4 







Day 1 


Day 2 


Day 3 


Model 


1 


7 


8 


8 


Model 


2 


3 


4 ' 


5 


Model 


3 


3 


5 


6 



Determine the parts-i>er-model matrix and the cost— peixlay matrix. 



1-8. Properties of Matrix Multiplication 

We have learned that insofar as only addition and subtraction are involved, 
the algebra of matrices is exactly like the ordinary algebra of numbers; see 
Section 1—5. At this moment, we might be concerned about multiplication since 
the definition seems a bit unusual. Is the algebra of matrices like the 
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ordinary algebra of numbers insofar as muLtiplication is concerned? 

Let us consider an example that will yield an answer to the foregoing 
question. Let 



A = 



0 ol 

1 oj 



and 



"o ll 
0 oJ 



If we compute AB, we find 



AB 



0 0 
0 1 



Now, if we reverse the order of the factors and compute BA, we fi 



BA = 



1 0 
0 0 



Thus AB and BA are different matricesl 
For anoti22x example, let 



A = 



1 2 
3 1 
-1 2 



and' B = 



12 3 
4 0 1 



Then 



AB 



1 2 
3 1 
-1 2 



12 3 
4 0 1 



1(1) + 2(4) 1(2) + 2(0) 1(3) + 2(1) 
3(1) + 1(4) 3(2) + 1(0) 3(3) + 1(1) 
-1(1) + 2(4) -1(2) + 2(0) -1(3) + 2(1) 



9 2 5 
7 6 10 
7 -2 -1 



while 



BA 



2 

4 0 



1(1) 
4(1) 



2(3) 
0(3) 



+ 3(-l) 
+ l(-l) 



1(2) 
4(2) 



2(1) 
0(1) 



3(2)1 
1(2) 



10 
10 



Again AB and BA are different matrices; they are not even of the same order'. 

Thus we have a first difference between matrix algebra and ordinary 
algebra, and a very significant difference it is indeed. When we multiply real 
numbers, wc can rearrange factors since the commutative law holds: For all 
X 6 R and y 6 R, we have xy = yx. When we multiply matrices, we have no 
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such law and we must consequently be careful to take the factors in the order 
given. We must consequently distinguish between the result of multiplying B 
on the right by A to get BA, and the result of multiplying B on the left 
by A to get AB. In the algebra of numbers, these two operations of "right 
multiplication" and "left multiplication" are the same; in matrix algebra, they 
are not necessarily the same. 

Let us explore some more differences'. Let 



A = 



3 1 

6 2 



and B 



-1 3 
3 --9 



Patently, A ^ 0 aaad B 9^ 0 . But if we compute AB, we obtain 



AB = 



3 1 
6 2 



-1 3 
3 ~9 



0 0 
0 0 



thus, we find AB = 0. Again, let 



A = 



12 0 
110 
-14 0 



and B - 



0 0 0 
060 
14 9 



Then 



AB 



12 0 
110 
-14 0 



0 0 

0 0 

1 4 



0 0 0 
0 0 0 
0 0 0 



The second major difference between ordinary algebra and matrix algebra is 
that the product of two matrices can be a zero matrix without either factor 
being a zero matrix. ^ 

The breakdown for matrix algebra of the law that xy = yx iand of the law 
that xy » 0 only if either x or y is zero causes additional differences. 

For instance, for real numbers we know that if ab « ac, and a 9* 0, 
then b = c. This property is called the cancellation law for multiplication t 



. Proof * We divided the proof into simple steps: 

(a) ab = ac, 

(b) ab — ac « 0, 

(c) a(b - c) « 0, 
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(d) b - c " 0, 

(e) b a c. 



For matrices, the above step from (c) to (d) fails and the proof is not 
valid* In fact, AB can be equal to AC, with A ^ 0, yet B «f< C. Thus, let 



12 0 
110 
-14 0 



Then 



12 3 

1 1 -1 

2 2 2 



and C 



12 3 
1 1 -1 
1 1 1 



AB 



3 4 1 

2 3 2 

3 2-7 



AC, 



and 



but 



A ^ 0, 



B ^ C. 



Let us consider another difference. We know that a real number a can 
have at most two square roots; that is, there are at most two roots of the ' 
equation xx = a . 

Proof > Again, we give the simple steps of the proof : 



(a 
(b 
(c 
(d 
(e 
(f 
(8 
(h 
(i 



Suppose that yy = a; then 

XX =^ yy, 

XX - yy « 0, 

(x — y)(x + y) « XX + (-yx + xy) - yy, 
yx » xy. 

From (d) and (e), (x - y)(x + y) » xx ~ yy . 
From (c) and (f), (x - y) (x + y) « 0 . 
Therefore, either x — y«0 or x+y = 0. 
Therefore, either x = y or x- — y. 



For matrices, statement (e) is false, and therefore the steps to (f) and 
(g) are invalid. Even if (g) were valid, the step from (g) to (h) fails. 
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Therefore, the foregoing proof is invalid if we try ro apply it to matrices. In 
fara:, it is false that a matrix can have at most two square roots: We have 



-1 



0 




1 


0 




1 


0 




1 




0 


1 




0 




* 


o' 




'l 


o' 




l 


0.' 




-1 




0 


--1 




0 


r 


* 


o' 




-1 


o"" 




' 1 


0-1 




1 




0 


1 




0 


1.: 








-1 


o" 










-1 




0 


-1 




c 


1 


• 



Thus the matrix 



1 0 
0 1 



has the four different square roots 



" 1 








0 


1 







-1 . 

A. 



-1 
0 1 



and L = 



-1 ol 
0 -ij 



There are morel Given any number x 0, we have 



0 X 




0 x" 




"i o' 


1/x 0 




I/x 0 




0 1 



By giving x any one of an infinity of different real values, we obtain an 
infinity of different square roots of the matrix I: 



0 2' 




"0 1/3" 


1/2 0 


> 


3 0 



0 



etc , 



Thus the very simple 2x2 matrix I has infinitely many distinct square 
roots'. You can see, then, that the fact that a real or complex number has at 
most two square roots is by no means trivial. 
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Exercises 1-8 



1. Let 



A « 



I and B « 1^ 



1 0 
-1 1 



Calculate: 



(a) AB, 

(b) BA, 

(c) (AB)A, 



(d) (BA)A, 

(e) (BA)B, 

(f) B(BA), 



(8) A(AB), 

(h) ((BA)A)B, 

(i) ((AB)A)B. 



2. Make the calculations of Exercise 1 for the matrices 



12 3 
4 5 6 
7 8 9 



and B « 



1 0 -1 
-10 1 
11-1 



3. Let A and B be as in* Exercise 2, and let 



10 0 
0 10 
0 0 1 



Calculate AI, lA, BI, IB, and (AI)B. 
4. Let 



1 -1 
0 2 



and B » 



1 0 
1 2 



Show by computation that ' 

(a) (A + B)(A + B) 9* A'^ + 2AB + B^, 

(b) (A B)(A - B) 9« A^ - B^, 
2 2 

where A and B denote AA and BB, respectively, 



5. Let 



10 0 
0 2 0 
0 0 3 



and B 
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2 3 2 3 3 2 3 

Calculate A, A, B, B, AB, AB, where A denotes A(AA) 

6. Find at least 8 square roots of the matrix 



I = 



10 0 
0 10 
0 0 1 



7 . Show that the matrix 



0 0 

1 0 



satisfies the equation A « 0. How many 2x2 matrices satisfying this 
equation can you find? 

8, Show that the matrix 



A = 



0 0 0 
10 0 
0 10 



satisfies the equation A = 0 • 



1-9. properties of Matrix Multiplication (Concluded) 

We have seen that two basic laws governing multiplication in the algebra 
of ordinary numbers break down when it comes to matrices. The commutative law 
and the cancellation law do not hold. At this point, you might fear a total 
collapse of all the other familiar laws. This is not the case. Aside from the 
two laws mentioned, and the fact that, as we shall see later, many matrices do 
not have multiplicative inverses (reciprocals), the other basic laws of 
ordinary algebra generally remain valid for matrices. The associative law 
holds for the multiplication of matrices and there are distributive laws that 
unite addition and multiplication. 

A few examples will aid us in understanding the laws. 

Let 



A « 



B = 



2 0 
1 1 



and C 5= 



-II] 
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Then 



A(BC) 



and 



(AB)C 



Thus, in this case, 



Again, 



A(B + C) 



and 



+ AC 



so chat also 



I 


0 




2 


0 




I 


I 




I 


L 


L 



-L 2 
3 L 



L 0 
I I 



-2 4 
2 3 



-2 4 
0 7 




A(BC) = (AB)C. 



L 0 

I I 

L 0 

I I 



i 2] = [5 4] ' 



L 0 

1 I 

2 0 

3 L 



2 0] 

^ [1 



-L 2 

3 



5 4j ' 



A(B + C) = AB + AC. 



(1) 



Since multiplication is not commutative, we cannot conclude from Equation 
(1) that the distributive principle is vAlid with the factor A on the right- 
hand side of B + C. Having illustrated the left-hand distributive law, we 
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now illustrate the right-hand distributive law with the following example: 
We have 



and 



(B + C)A « 



2 0 
1 1 



--1 2 
3 1 



1 0 
1 1 



1 2 
4 2 



1 0 
1 1 



3 2 
6 2 



BA + CA = 



2 0 

1 1 

2 0 
2 1 



"l 


o' 




-1 


2* 




1 


o' 


1 


1 


+ 


3 


1 




1 


1 



1 2 
4 1 



3 2 
6 2 



Thus, 

(B + C)A = BA + CA. 

You might note, in passing, that, in the above example, 

A(B + C) 5* (B + C)A. 

These properties of matrix multiplication can be expressed as theorems, 
as follows. 



Theorem 1-5. If 



A = [a. . 1 , B =» [b., 1 , and C = [c, ,1 

[ ij J mx p [ jkj px n' [ khj nx > 



then 



(AB)C = A(BC) 



Proof , (Optional.) We have 
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AB = 



-1 Ij Jk; 



mx n 



(AB)C 



BC 



A(BC) 



m xq 



pxq 



mxq 



Since the order of addition is arbitrary, we know that 



n / p 



mx q 



Hence, 



(AB)C = A(BC), 



Theorem 1-6. If 



[^j].xp' ^= [V]pxn' ^= [=jk] 



then A(B + C) = AB + BC. 

Proof . (Optional.) We have 



(B + C) 
A(B + C) 



= [V^'jkjpxn' 



m Xn 



P P 



j=l ^tl 



.mx n 
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Its 



m Xn 



raX n 



« AB + AC . 



Hence , 



A(B + C) « AB + AC. 



Theorem 1-7 . If 

hk]pxn' ['^Jklpxn' nxq ' 

then (B + C)A « BA + OA. 

Proof . The proof is similar to that of Theorem 1-6 and will be left as 
an exercise for the student. 

It should be noted that if the commutative law held for matrices, It would 
be unnecessary to prove Theorems 1-6 and 1-7 separately, since the two state- 
ments 



A(B + C) » AB + AC 

and 

(B + C)A = BA + CA 

would be equivalent. For matrices, however, the two statements are not equlva-* 
lent, even though both are true. The order of factors Is most Important, since 
statements like 

A(B + C) = AB + GA 

ind 

(B + C)A " AB + CA 

:.an be false for matrices. 
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Earlier we defined the zero matrix of order m x n and showed that It Is 
the Identity element for matrix addition: 



A + 0 « A, 



vhere A Is any matrix of order m X n. This zero matrix plays the same role 
in the multiplication of matrices as the number zero does in the multiplication 
of real i^umbers. For example, we have 



2 0 3 
1 1 4 



0 0 
0 0 
0 0 



0 0 
0 0 



Theorem L-8 . For any matrix 



we have 



^px n ^ [^ij] pxn' 



0 A 
mx p pxn 



0 and A 0 =0 

mx n p Xn nXq pXq 



The proof Is easy and is left to the student. 

Now we may be wondering if there is an Identity element for the multiplica- 
tion of matrices, namely a matrix that plays the same role as the number 1 does 
In the multiplication of real numbers. (For all real numbers a, la « a = al.) 
There Is such a matrix, called the unit matrix , or the identity matrix for 
multiplication, and denoted by the symbol I. The matrix namely. 



1 0 
0 1 



is called the unit matrix of order 2. The matrix 



^3 = 



10 0 
0 10 
0 0 1 



Is called the unit matrix of order 3. In general, the unit matrix of order 
n Is the square matrix [^ij] nx n ^"^^ ^^^^ ^ij ^ ^ * J 
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« 0 for all i j (i « 1,2, ...,n; j « L,2,,..,n), We now state the 
Important property of the unit matrix as a theorem. 

Theorem 1-9 . If A is an m x n matrix, then AI = A and I A » A. 

n m 

Proof. By definition, the entry in the i-th row and j-th column of the 

product AI is the sum a . . e + a.^e + ... + a. e Since e, « 0 for 
n 11 Ij i2 2j in nj kj 

all k j4 j, all terms but one in this sum are zero and drop out. We are 

left with ^ij^jj a^j* Thus the entry in the i-th row and j-th column of the 

product is the same as the corresponding entiry in A, Hence AI "A. The 

n 

equality I^A » A may be proved the same way. In most situations, it is not 
necessary to specify the order of the unit matrix since the order is inferred 
from the context. Thus, for 

I A « A » AI , 
m n' 

we write 

lA « A « AI. 



For example, we have 



and 



1 2 

3 4 
5 6 



1 0 
0 1 



1 2 
3 4 
5 6 



10 0 
0 1-0 
0 0 1 



1 2 

3 4 
5 6 



1 2 

3 4 
5 6 



Exercises 1-9 



Let 



0 1 

1 0 



0 1 
-1 0 



and 



[ 0 0 
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Test the formulas 



A(B + C) « AB + AC, 
(B + C)A a BA + CA, 
A(B + C) « AB + CA, 
A(B + C) « BA + CA. 

Which are correct, and which are false? 
Let 



A « 



0 0 

1 0 



and B = 



1 0 
0 0 



Show that AB ^ 0, but BA = 0. 

Show that for all matrices A and B of the form 



A « 



a b 
-b a 



and B 



c d 
— d c 



we have 



AB « BA. 



Illustrate by assigning numerical values to a, b, c, and 
a, b, c, and d integ;ers. 

Find the value of x for which the following product is I: 



2 


0 


7 




0 


1 


0 




1 


2 


1 

- 





-X — I4x 7x 
0 10 
X 4x •-•2x 



For the matrices 



0 0 0 
10 0 
0 10 



B = 



0 0 0 
0 0 0 
10 0 



and 



C = 



show that AB =* BA, that AC CA, and that BC « CB. 



Sho\' that the matrix 
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A 



0 10 
0 0 1 
10 0 



satisfies the equation A « 1. Find at least one more solution of this 
equation. 



7. Show that for all matrices A of the form 



ab b 
2 

—a — ab 



we have 



A^ = 0, 



Illustrate by assigning numerical values to a and b, 
8. Let 



D = 



0 1 

1 0 



E = 



0 -1 

1 0 



and F 



1 0 
0 -1 



Compute the following: 

(a) DE, 

(b) DF, 

(c) EF, 



(d) ED, 
FD, 
(f) FE. 



If AB » — BA, A and B are said to be anticommutative . l^hat con- 
clusions can be drawn concerning D, E, and F? 

" 3 l" 
-1 2 



9. Show that the matrix A « 



. A - 5A + 71 = 0. 
10. Explain why, in matrix algebra, 



is a solution of the equation 



(A + B) (A - B) ^ A^ - B^ 



except in special cases. Can you devise two matrices A and B that 
will illustrate the inequality? Can you devise two matrices A and B 
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that will illustrate the special case? (Hint: Use square matrices of 
order 2.) 

11. Show that if V and W are n X 1 column vectors, then 

12. Prove that (AB) = B A , assuming that A and B are conformable for 
multiplication • 

13. Using notation, prove the right-4iand distributive law (Theorem 1.7). 
1—10 . Summary 

In this introductory chapter we have defined several operations on 
matrices, such as addition and multiplication. These operations differ from 
those of elementary algebra in that they cannot always be performed. Thus, 
we do not add a 2x2 matrix to a 3x3 matrix; again, though a 4x3 
matrix and a 3x4 matrix can be multiplied together, the product is neither 
4x3 nor 3x4. More importantly, the commutative law for multiplication 
and the cancellation law do not hold. 

There is a third significant difference that we shall explore more fully 
in later chapters but shall introduce now. Recall that the operation of 
subtraction was cloaely associated with that of addition. In order to solve 
equations of the form 

A + X =3 B, 

it is convenient to employ the additive inverse, or negative, ~A. Thus, if 
the foregoing equation holds, then we have 

A + X + (---A) « B + (-A), 
X + A + (-A) B + (~A), 
X + 0 « B - A, 
X « B - A. 



As you know, every matrix A has a negative -A. Now "division" is closely 

61 

[sec. 1-9] 



51 



associated with multiplication in a parallel manner. In order to solve 
equations of the form 

AX B, 

we would analogously employ multiplicative inverse (or reciprocal), which is 
denoted by the symbol A The defining property is A = I = AA""^ • This 

enables us to solve equations of the form 

AX B. 

Thus if the foregoing equation holds, and if A has a multiplicative inverse 

•^1 ' 
A , then 

a""'^(ax) = a'^^b, 
(a"'^A)X a'^^b, 
IX = A*"^B, 

X « a"^b. 



Now,, many matrices other than the zero matrix 0 do not possess multiplicative 
inverses; for instance. 




are matrices of this sort. This fact constitutes a very significant difference 
between the algebra of matrices and the algebra of real numbers. In the next 
tw chapters, we shall explore the problem of matrix inversion in depth. 

Before closing this chapter, we should note that matrices arising in 
scientific and industrial applications are much larger and their entries much 
more complicated than has been the case in this chapter. As you can imagine, ' 
the computations involved when dealing with larger matrices (of order 10 or 
more), which is usual in applied~work, are so extensive as to discourage their 
use in hand computations. Fortunately, the recent development of higL -speed 
electronic computers has largely overcome this difficulty and thereby has made 
it more feasible to apiply matrix methods in many areas of endeavor, 
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Chapter 2 
THE ALGEBRA OF 2x2 MATRICES 



2—1, Introduction 

In Chapter 1, we considered the elementary operations of addition and 
multiplication for rectangular matrices. This algebra is similar in many 
respects to tl.e algebra of real numbers, although there are important differ- 
ences. Specifically, we noted that the commutative law and the cancellation 
law do not hold for matrix multiplication, and that division is not always 
possible . 

With matrices, the whole problem of division is a very complex one; it is 
centered around the existence of a multiplicative inverse. Let us ask a 
question: If you were given the matrix equation 



1 


2 


3 


4 




'^11 






1 


0 


2 


o' 


8 


9 


0 


--1 






0 


1 


0 


2 


4 


5 


6 


5 










2 


0 


1 


0 


0 


4 


2 


0 




hi 






0 


2 


0 


1 



could you solve it for the unknown 4x4 matrix X? Do not be dismayed 
if your answer is "No." Eventually, we shall learn methods of solving this 
equation, but the problem is complex and lengthy. In order to understand 
this problem in depth and at the same time comprehend the full significance 
of the algebra we have developed so far, we shall largely confine our attention 
in this chapter to a special subset of the set of all rectangular matrices; 
namely, we shall consider the set of 2 x* 2 square matrices . 

When one stands back and takes a broad view of the many different kinds of 
numbers that have been studied, one sees recurring patterns. For instance, let 
us look at the rational numbers for a moment. Here is a set of numbers that ue 
can add and multiply. This statement is so simple that we almost take it for 
granted. But it is not true of all sets, so let us give a name to the notion 
that is involved. 

Definition 2-1 « A set S is said to be closed under an operation R on 
a first member a of S and a second member b of S if 

(i) the operation can be performed on each a and b of S, 
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(ii) for each a and b of S, the result of the operation is a 
member of S • 

For example, the set of positive integers is not closed under the operation 
of division since for some positive integers a and b the ratio a/b is not 
a positive integer; neither is the set of rational numbers closed under division, 
since the operation cannot be performed if b =0; but the set of positive 
rational numbers is closed under division since the quotient of two positive 
rational numbers is a positive rational number. 

Under addition and multiplication, the set of rational numbers satisfies 
the following postulates: 

The set is closed under addition. 

Addition is commutative. - 

Addition is associative. 

There is an identity member (0) for addition. 

There is an additive inverse member —a for each member a. 



The set is closed under multiplication. 
Multiplication is commutative. 
Multiplication is associative. 



There is an identity member (1) for multiplication 

"lere is a multi 
other than 0 



There is a multiplicative inverse member a ^ for each member a, 



Multiplication is distributive over addition. 

Since there exists a rational multiplicative inverse for each rational number 
except 0, division (except by 0) is always possible in the algebra of 
rational numbers. In other words, all equations of the form 

ax = b, 

where a and b are rational numbers and a ^ 0, can be solved for x in 
the algebra of rational numbers. For example: In order to solve the equation 
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-3 X => J . 

we multiply both sides of the equation by - 3/2, the multiplicative inverse 
of - 2/3. Thus we obtain 

or 

3 

which is a rational number. 

The foregoing set of postulates is satisfied also by the set of real 
numbers. Any set that satisfies such a set of postulates is called a field . 
Both the set of real numbers and the set of rationals, which is a subset of 
the set of real numbers, are fields under addition and multiplication. There 
are many systems that have this same pattern. In each of these systems, 
division (except by 0) is always possible. 

Now our immediate concern is to explore the problem of division in the set 
of matrices. There is no blanket answer that can readily be reached, although 
there is an answer that we can find by proceeding stepwise. At first, let us 
limit our discussion to the set of 2x2 matrices. We do this not only to 
consider division in a smaller domain, but also to study in detail the algebra 
associated with this subset. A more general problem of matrix division will be 
considered in Chapter 3. 

Exercise's 2—1 

1. Determine which of the following sets are closed under the stated 
operation: 

(a) the set of integers under addition, 

(b) the set of even numbers under multiplication, 

(c) the set {l) under multiplication, 

(d) the set of positive irrational numbers under division, 
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(e) the set of integers under the operation of squaring, 

(f) the set of numbers A = {x: x > 3) under addition. 

Determine which of the following statements are true, and state which of 
the indicated operations are commutative: 

(a) 2 - 3 « 3 - 2, 

(b) 4 4 2 = 2 4, 

(c) 3+2=2+3, 

(d) ya + yt) = yb + ya, a and b positive, 

(e) a - b = b - a, a and b real, 

(f) pq » qp, p and q real, 

(g) + 2 - 2 + y=r. 

Determine which of the following operations -f, defined for positive 
integers in terms of addition and multiplication, are commutative: 

(a) X -t y = X + 2y (for example, 2-i3 = 2+ 6 = 8), 

(b) X -t y = 2xy, 

(c) X -t y = 2x + 2y, 

2 

(d) X -t y = xy , 

(e) X * y = x^, 

(f) x-ty = x+ y+ l- 

Determine which of the following operations *, defined for positive 
integers in terms of addition and multiplication, are associative: 

(a) X * y = X + 2y (for example, (2 * 3) * 4 = 8 * 4 = 16) , 

(b) X * y = X + y, 

2 

(c) X * y = xy , 

(d) X * y = X, 

(e) X * y = -/y^y 

(f) X * y = xy + 1 . 

Determine whether the operation * is distributive over the operation 
that is, determine whether x * (y * z) (x * y) ^ (x * z) and 
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(y z) >v X = (y * x) (z * x) , where the operations and * are 
defined for positive integers in terms of addition and multiplication of 
real numbers: 

(a) x-iy^x+y, x*y = xy; 

(b) X -i y = 2x + 2y, x * y = | xy; 

(c) x-iy» X y + 1, x*y = xy. 

Why is the answer the same in each case for left— hand distribution as it 
is for right— hand distribution? 

6. In each of the following examples, determine if the specified set, under 
addition and multiplication, constitutes a field: 



(a) 


the set 


of 


all 


positi e numbers. 






(b) 


the set 


of 


all 


rational numbers. 






(c) 


the set 
a and 


of 
b 


all 
are 


real numbers of thd form a 
integers , 


+ by2. 


where 


(d) 


the set 
a and 


of 
b 


all 
are 


complex numbers of the form 
real numbers and i -^/—l 


a + bi. 


where 



2-2, The Ring of 2x2 Matrices 

Since we are confining our attention to the subset of 2x2 matrices, 
it is very convenient to have a symbol for this subset. We let M' denote the 
set of all 2x2 matrices. If A is a member, or element, of this set, we 
express this membership symbolically by A € M. Since all elements of M are 
matrices, our general definitions of addition and multiplication hold for this 
subset . 

The set M is not a field, as defined in Section 2—1, since M does not 
have all the properties of a field; for example, you saw in Chapter i that 
multiplication is not commutative in M. Thus, for 



0 0 

1 0 



and B « 



0 1 
0 0 



we have 



[sec. 2-1] 

• ' 67 



58 



AB 



0 0 
0 1 



while BA ■= 



0 0 J 



Let us now consider a less restrictive sort of inatheinatlcaL system known 
as a ring ; this name Is usually attributed to David Hllbert (1862-1943). 

Definition 2-2 , A ring Is a set with two operations, called addition and 
multiplication, that possesses the following properties under addition and 
multlplicatrion: 

Tie set Is closed under addition. 

Addition Is commutative. 

Addition is associative. 

There Is an Identity element for addition. 
There Is an additive inverse for each element. 

... 

The set is closed under multiplication. 
Multiplication is associative. 



Multiplication is distributive over addition. 

Does the set M satisfy these properties? It seems clear that it does, 
but the answer is not quite obvious. Consider the set of all real numbers. 
This set Is a field because there exists, among other things, an additive 
Inverse for each number In this set. Now the poBitive Integers are a subset 
of the real numbers. Does this subset contain an additive Inverse for each 
element? Since we do not have negative Integers In the set under consideration 
the answer Is "No"; therefore, the set of positive Integers Is not a field. 
Thus a subset does not necessarily have the same properties as the complete 
set . 

To be certain that the set M Is a ring, we must systematically make sure 
that each ring criterion Is satisfied. Fcr the most part, our proof will be a 
reiteration of the material In Chapter 1, since the general properties of 
matrices will be valid for the subset M of 2x2 matrices. The sum of two 
2x2 matrices Is a 2x2 matrix; that Is, the set Is closed under addition. 
For example, 68 
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0 1 
2 3 



-1 2 
3 -1 



-1 3 
5 2 



The general proofs of comntutativity and associativity are valid. The unit 
matrix is 



1 0 
0 1 



the zero matrix is 



0 0 
0 0 



and the additive inverse of the matrix 



a b 
c d 



is 



—a — b 
— c — d 



When we consider the multiplication of 2x2 matrices, we must first verify 
that the product is an element of this set, namely a 2x2 matrix. Recall 
that the number of rows in the product is equal to the number of rows in the 
left-hand factor, and the number of columns is equal to the number of columns 
in the right-hand factor. Thus, Che product of two elements of the set M 
must be an element of this set, namely a 2x2 matrix; accordingly, the set 
is closed under multiplication. For example, 



1 2 
3 A 



1 -1 
-1 1 



The general proof of associativity is valid for elements of M, since it 'is valid 
for rectangular matrices. Also, both of the distributive laws hold for elements 
of M by the same reasoning. For example, to illustrate the associative law 
for multiplication, we have r\ g\ 
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L 2 
3 4 



] [-1 i) [; I] - in 1] [; I] - [\ 1] . 



and also 



and to illustrate the left-hand distributive law, we have 



[3 1] 




[; 0]) - [\ 



2 

3 4 



1 1 

0 1 



1 3 
3 7 



and also 



^^][-ll]^[3j][?o] 



-1 1 
-1 1 



2 2 
4 6 



] = [3 



3 

3 7 



Since we have checked that each of -the ring postulates is fulfilled, we 
have shown that the set K of 2 x 2 matrices is a ring under addition and 
multiplication. We state this result formally as a theorem. 



Theorem 2-1 . The set M of 2x2 matrices is a ring under addition 
and multiplication. 

Since the list of defining properties for a field contains all the defining 
properties for a ring, it follows that every field is a ring. But the converse 
statement is not true; for example, we now know that the set M of 2x2 
matrices is a ring but not a field. The set M has one more of the field 
properties, namely there is an identity element 

for multiplication in M; that is, for each A 6 M we have 

lA - A - AI. 

Thus the set M is a ring with an identity element . 
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At this time, we should verify that the commutative law for multiplication 
and the cancellation law are not valid In M by giving counterexamples. Thus 
we have 



1 2 
3 4 



[-; 1] 



"1 1 
"1 1 



but 



1 -1 
-1 1 



1 2 
3 4 



-2 -2 

2 2 



so the coimnutatlve law for multiplication does not hold. Also, 



fo ol [o ol 

[1 Oj [2 o] 



0 0 

1 0 



0 0 
3 0 



so that the cancellation law does not hold. 



Exercises 2—2 



1. Determine If the set of all Integers Is a ring under the operations of 
addition and multiplication. 

2. Determine which of the following sets are rings under addition and 
multiplication: 

(a) the set of numbers of the form a + b y/l^ where a and b 
are integers; 

(b) the set of four fourth roots of unity, namely, +1, —1, 1, 
aind -1 ; 



(c) the set of numbers a/2, where a is an integer. 



3, Determine if the set of all matrices of the form 



, with a £ R, 



forms a ring under addition and multiplication as defined for matrices. 



4. Determine if the set of all matrices of the form 



with a € R, 

forms a ring under addition and multiplication as defined for matrices. 



a 0 
0 a2 
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2-3 > The Uniqueness of the Multiplicative Inverse 

Once again we turn our attention to the problem of matrix division. As we 
have seen, this problem arises v?hen we seek to solve a matrix equation of the 
form 



AX 



Let us look at a parallel equation concerning real numbers, 



ax = c . 

Each nonzero number a has a reciprocal ! 1/a, which is often designated a ^. 
Its defining property is aa""^ = 1. Since multiplication of real numbers is 
commutative, it follows that a""^a = 1. Hence if a is a nonzero number, 
Chen there is a number b, called the multiplicative inverse of a, such that 



ab = 1 = ba (b « a 

Given an equation ax c, where a ^ 0, the multiplicative inverse b enables 
us to find a solution for x; tiius, 

b(ax:) = be, 
(ba)x = be, 
Lx = be, 
X = be. ■ 

Now our question concerning division by matrices can be put in another way. If 
A € M, is there a B G M for which the equation 

AB = I = BA 

is satisfied? We shall employ the more suggestive notation A ^ for the in- 
verse, so that our question can be restated: Is there an element A € M for 
which the equation 

AA'"^ = I « a'^^a 
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is satisfied? Since we shall often be using this defining property, let us 
state it formally as a definition. 

Definition 2-3 . If A e M, then an element a'"^ of M is an inverse of 
A provided 

AA*"^ = I =» a""^A. 

If there were an element B corresponding to each element A € M such 

that 

AB = I = BA, 

then we could solve all equations of the form 

AX = C, 

since we would have 

B(AX) « BC, 
(BA)X BC, 
IX « BC, 
X « BC, 

and clearly this value satisfies the original equation. 

From the fact that there is a multiplicative inverse for every real number 
except zero, we might wrongly infer a parallel conclusion for matrices. As 
stated in Chapter 1, not all matrices have inverses. Our knowledge that 0 
has no inverse suggests that the zero matrix 0 has no inverse. This is true, 
since we have 

OX « 0 

for «ll X G M, so that there cannot be any X e M such that 

OX « I. 
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But there Is a more fundamental difficulty than this. Let us take the 
nonzero matrix 



0 0 J 



and try to solve the equation 



AX « I, 



for X € 



If we let 



p q 

r 8 



then ve find that 



AX 



" L 


0* 




p 




S3 


P 


q 


0 


0 




r 


s 




0 


0 



Hence, no matter what entries we take for X, we cannot have 



AX " I 



since the entry in the Lower right-hand comer of AX is zero, and the entry 
in the Lower right— hand comer of I is L, 

At this pointy you might be thinking that no matrix has an inverse* Do not 
move too fasti Note that 

I-I « I « I-I. 

This means that I is its own inverse, just as L is its own inverse among the 
reaL numbers • 

Also, Let us note that 



'l/2 0 ^ 




"2 o' 




0 L/2 




0 2 





Thus the matrix 



II] 
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' 2 0* 




'l/2 0 




0 2 




0 L/2 


* 
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2 0 
0 2 



has the Inverse 



-1 



1/2 0 ] 
0 1/2 J - 



Consequently, the equation 



may be solved by nultlplylng both sides by A ^, thus: 



[1/2 0 ] [2 Ol 
[ 0 l/2j[0 2j ^ 



1/2 0 
0 1/2 



1 2 
3 4 



•l ol [1/2 ll 

0 ij ^ [ 3/2 2j • 



1/2 1 
3/2 2 



This Is a specific illustration of a general pattern. Let a be any 
nonzero number. Now , 



I = 11 

-1, 
= aa I 



aa'^\l)(I) 



Since the multiplication of real numbers and matrices is associative and 
commutative, it follows that for all real numbers a and b, and all 2 X 
matrices X and Y, we have 

abXY = (aX)(bY). 



In particular, then, 



I « (aDCa"^^!). 



[boo^ 2-3] 



66 

Since aa ^ » a we can also state that. 



This result enables us to enumerate a large number of matrices and their in- 
verses. Thus, let A a al; then a""^ a""^!. For example, if a » 3 then 



3 0 
0 3 



and 



1/3 0 
0 1/3 



If a « 0 .2, then 



0.2 0 
0 0.2 



and 



-1 



3] 



At least we know that there are a great many matrices A with the property 
that there is a corresponding matrix B such that 

AB « I « BA. 



Before turning to the problem of finding those matrices that have inverses, 
let us show first that if a matrix has an inverse, it has only one inverse; 
that is, this inverse is unique. For instance, in the example directly above, 
we saw that 



5 0 
0 5 



if 



0.2 0 
0 0.2 



We wish to show that there is no other inverse. Suppose that we have elements 
A, B, and C of M such that 

AB = I « BA, 



and 



AC « I - CA; 

that is, A has an inverse B, and A also has an inverse C. Multiply the 
first of these two equations on the left by C. Then 
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C(AB) = CI, 
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(CA)B « C , 

since multiplication is associative and I is the unit matrix. But CA « I. 
Hence 

IB « C, 

or 

B = C. 

This result is so important that we call it a theorem and state it formally: 

.-1 .-1 



Theorem 2- 



-2. If A e M and if there exists A , A € M, t . . fi that 



AA^«I«A^A, 



then a""^ is unique; that is, there is no other solution X of the equations 

AX « I « XA. 

Now we can readily show that A is the inverse of A ^ if we know that 
A^J is the inverse of A. This may seem a bit trivial, but it is important 
enough to state formally and prove. 

Theorem 2-3 . If A € M and if A has an inverse A ^, then A ^ also 
has an inver9e; namely, A is the inverse of A . 

Proof , Since A"^^ is the inverse of A, this means, by definition, that 

AA''^ a I b a""^ a. 
However, the statement of equality can be given in reverse order: 
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A^^ A « I =« AA ^ . 



This, by de'finitlon, is the statement that A is the inverse of A ^, 



Exercises 2—3 



1. Show that each of the following matrices does not have a multiplicative 
inverse: 



(a) 



To ll 
[o oj ' 



(b) 



1 0 
1 0 



(c) 



1 1 
1 1 



id) 



0 
-3 



2. Which of the following pairs of elements of M are inverses of one 
another? 



(a) 



1 0 
0 1 



and 



(b) 



(c) 



-1 
-3 

4 
1 



and 



and 



3 -1 

2 -1 

1 

-6 2 



(e) 



-5 7 

L 0 2 

"a b 
c d 



and 



and 



p 1 

[l 0 

[ d -b 

— c a 



3. Use the argument in the text to show that, since 



1 ^2 
-3 6 



2 2 
1 1 



0 f 



neither of the matrices in the product is invertible (has an inverse), 
2 

4, Show that if a + be « 0, then 



a b 
c ~a 



« 0 , 
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and hence that in this case 

[a b" 

has no inverse . 

5. Show that if A e M, B g M, B 0, and AB « 0 , then A cannot hc,ve 

an inverse. Can B have an inverse? 

2 

6. Show that if A e M, and A - 4A 0, then either A « 41 or A has 
no inverse. (Hint: Factor the left-hand side and note Exercise 5.) 

7. Show that if A e M, B g M, C g M, and AB = 1 = CA, then B « C. 

8. Show by direct computation that the matrix 

^ " [ 0 3 

satisfies Che equation 

- 2A - 31 = 0, 



Chat is, 



-1 2 
0 3 



- 2 



0 3j 



1 0 
0 1 



0 0 
0 0 



9 . The matrices 



and 



5 3 
2 1 



are lnveiv.^es of one another. Are their squares also inverses? Their 
transposes ? 

10. Since 



A = A*A, 

.3 



2 2 
A-A A .A, 

A.A-^ « a'.A"^, 
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and so on, we can readily demonstrate that ^ is the inverse of A if 
A^ = I. Using this information, compute the inverse of each of the follow- 
ing matrices: 



(a) 
(b) 
(c) 



11. Let 



0 -1 

-1 0 

0 -1 

1 0 



cos 6 sin 8 
-sin e cos e 



2 3 
and compute B and B if 9 = 120°, 



12. If 



3 -A 
1 -1 



verify that 

A^ - 2A + I = 0. 

Does the transpose of A also satisfy this same equation? 
13, Prove that if A e M, if p, q, and r are numbers, and if 



pA + qA + rl 



with r 0, then A has an inverse. (Hint: Subtract the "constant 
term" rl from both members of the equation and factor the remaining terms 
in the left member.) 

14. Prove by direct substitution that if 




p q 

r s 
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then 

- (p + s)X + (ps - qr)I ~ 0. 

Show that X has an inverse if and only if ps - qr 0. (Hint: Use 
Exercise 13.) 

2 

15. Use the result of Exercise 14 to show that if X = 0 then ps — qr = 0 
and p + s =3 0 . (Perhaps you may have to consider several cases in the 
proof.) 



2—4. The Inverse of a Matrix of Order 2 

At this point, we have proved that the inverse of a 2x2 matrix, if it 
exists, is unique. Also, we know that there are some 2x2 matrices that 
have inverses and there are some that do not have inverses. But we have not 
yet developed any general methods of attacking the problem. Certainly our 
algebra will lack power unless general methods are developed. We are in a 
situation similar to that in which a student finds himself when he has not yet 
learned the quadratic formula or the general procedure for deriving it. He can 
find the roots of many quadratic equations by trial, but he has no means for 
solving all these equations. 

It is our purpose now to develop a general method of determining the 
inverse of a 2x2 matrix when it exists. We shall begin with a matrix whose 
entries are specific numbers and then duplicate our procedure with a matrix 
whose entries are more general. To start, we shall consider the matrix 




3 -1 
5 -2 



and determine whether there is an inverse B such that AB =» I = BA. If we let 



B = 



p q 

r s 



then 



3 


-l' 




p q 




r-l 


o' 


_5 


-1 




r s 




0 


i 



or 
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3p -r 3q -s 
5p - 2r 5q - 2s 



1 0 
0 1 



If these two matrices are equal, the respective entries are equal. Thus we 
have four equations, 

3p - r =» 1, (1) 3q s = 0, (3) 

5p - 2r « 0, (2) 5q - 2s « 1, (4) 

After multiplying Equation (1) by 2, we subtract Equation (2) from Ec^uation 
(1) and obtain 



p =» 2, 



By substituting this value of p in either Equation (1) or Equation (2), we 
obtain 



r «= 5. 



Equations (3) and (4) can be solved similarly, yielding 



q = — 1 and s = — 3 . 



Now if we substitute these values for p, q, r, and s, we obtain 



[3 :3] • 



To demonstrate that B is the inverse of A, we must show that AB " I ■» BA. 
This is easy: 



3 -1 
5 -2 



2 -il 
.5 -3 J 



[I I] - [5 5] [5 :2] 



Using the notation for the inverse of a matrix introduced earlier, we may write 

-1 



3 -1 
5 -2 



2 -1 
5 -3 
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In our next step, we shall follow the same pattern as above; but now we 
shall use a general notation for our matrix A. Instead of having specific 
real numbers for entries, we let 



a b 
c d 



As before, we represent the inverse, if it exists, as 



p q 

r s 



Assuming AB = I, we have 



'a b" 




p q 




"ap 


+ br 


c d 




r s 




cp 


+ dr 



+ bs 


a 


' 1 


o' 


+ ds 




0 


1 



This matrix equation may be written as four equations. 



ap + br = 1, (5) 
cp + dr = 0, (6) 



aq + bs = 0, (7) 
cq + ds = 1. (8) 



Since we wish to find values for p, q, r, and s, in terms of the real 
numbers a, b, c, and d, we multiply Equation (5) by d, Equation (6) by 
and then subtract. We obtain 

adp — bcp a d, 

or 

(ad — bc)p = d , 

Repeating this process, using appropriate pairs of equations, we obtain 

(ad - bc)q = -h, (ad - bc)r = -c, (ad - bc)s « a. 

Should it happen that ad — be = 0, then it follows from the four equa- 
tions, above, that a«b=c«d=0, so that A = 0. 

We have seen in Section 2.3 that the zero matrix does not have an inverse 

[sec, 2"^] 

83 



7^^ 

Hen^^ if ^id — be w 0 ve have a contradiction of the assumption that the matrix 
A an i-'^verse b* In other words, if A has an inverse, then ad - be ^ 0- 

TempC'^^rily, let us denote the number ad - be by h. Now if h ^ 0, we 
may ^rite 



P « K , q 



b 

h * 



r « — f: 



c 

h * 



s «^ 



Sul7^^ituti^6 these values appropriately in B, we obtain 



B = 



d b 

h h 

c a 

h h 



d -b 
-c a 



In c^tder show that this matrix is the inverse of A, we check: 



AB « 



d 
h 

£ 

h 



ad-bc 
h 

cd— cd 



-ab+ab 



h 

-be -fad 



1 ol 

0 ij 



We ^^st aJ-So make sure that BA =" I> thus: 



BA 



d 
h 
e 
h 



a b 

e d 



bd-bd 




1 0 
0 1 



Xb^ 5aet '^^t the relationship BA = I follows from the relationship AB » I 
is quite significant. While the definition of the inverse ' demands the existence 
an^ ^qual^-^ of vhat are called left and right inverses, we have shown that for 
2/2 ma^^^ices the existence of one implies the existence of the other and that 
t£ ^Key e^^i^t then they are, in fact, the same. Since the multiplication of 
^Ctices not generally commutative, we might have expected otherwise. 
We gl^^ll Stat, our result formally as a theorem. 



TheoJSSL^I^. If the matrix 



a b 
c d 



has an inverse, then h «= ad - be 0 
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a b 

c d 



d b 

h h 

c a 

h h 



Also, we state the converse of this result concerning h: 

a b1 



Theorem 2-5 , If h « ad - be 0, then the matrix 
inverse, which is 



c d> 



has an 



*d 


b 


h 


^ h 


c 


a 


h 


h 



Proof . Direct multiplication shows that 



a b 

c d 



d 


b 






h 


" h 




' 1 o' 


c 


a 




0 1 


h 


h 







d 
h 
c 
h 



'a bl 

c dj 



Xffu will note that Theorems 2,4 and 2,5 together state that the matrix 
a b 

^ ^ has an inverse if and only If h o . That is, the condition h 0 
is botn necessaty and sufficiient for the matrix to have an inverse. You should 
remember the formula 



a b 

c d 



-1 



d 

h 

£ 
h 



ad - be 0 . 



Exercises 2-4 



1. For each of the following matrices, determine whether the inverse exists; 
if it does exist, find it: 



(a) 
(b) 



1 1 

0 1 

1 1 
1 1 



(c) 



(d) 



-3 7] 
9 21 J • 

[ 1 ij ' 
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(e) 
(f) 



-2 0 - 2-6 

3 4j • '•^•'-13 

2 a" 

0 -7 



Each of the following matrices is actually a function in the sense that it 
depends on the value assigned to x, where x e R. Determine those values 
of X for which the matrix has no inverse. 



(a) 
(b) 



2 1 
X 1 



3 

X X 



(c) 
(d) 



x+2 
2 



0 

x-1 

3 



Show that each matrix of the form 



cos 6 sin 0 
—sin 0 cos 0 



has &n inverse and find it. Show that the product of two such matrices 
(different values of 0) is again such a matrix. (Hint: Use the addition 
formulas from trigonometry.) 

Show that if A C H then A has an inverse if and only if its transpose 
has an inveroe. If A has an inverse show that 

transpose (A ^) « (transpose A) ^. 

Prove Theorem 2—3 by first computing A ^ by Theorem 2-4 and then using 
Theorem 2—5 to compute the inverse of A ^. 

Under the assumption that the element A of H has an inverse, show how 
to solve the equation AX « B, with B a 2 x 1 matrix. Apply this to 
solve the following equations: 

(a) 2x + 3z 9, (c) 2y 3w = 0, 
-X + 42 = 10 ; -y + 4w = 0 ; 

(b) 3x + 2 = 0, (d) 3y + w = 1, 



-2x + 2 « 1 ; _ _ -2y + w = 0 . 
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2'-'5. The Determinant Function 

We have seen that the criterion for the existence of an inverse for the 
matrix 



a b 
c d 



involves the value of the expression ad — be . If ad — be 9* 0, the inverse 
does exist; if ad - be « 0, the inverse does not exist. Each 2x2 matrix 
determines one real value for ad be. For example, 



if A « 
if A = 
if A =^ 



1 0 
0 1 

2 3" 
4 6 



[0.5 



0.6 



lu^^n ad - be « 1(1) - 0(0) « 1; 
then ad - be « 2(6) - 3(4) « 0; 
then ad - be « 0.5(0.6) - 3(4) « -11.7. 



(Note that the second matrix does not have an inverse.) With each matrix of M 
there is thus associated one value, a real number determined by the entries. It 
is convenient to give a name to this number, the value of the expression 
ad — be, which is associated with the matrix 



a b 
c d 



Definition 2-4. If 



a b 
c d 



then &(X) » ad — be is called the determinant of X. 



Thus & assigns to each member X of M a real number 5(X) , read 
"delta of X." It is appropriate to regard this assignment or riiapping as a 
funct-f.on from the set H of 2X2 matrices 



a b 
c d 



to the set R of real numbers 
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We indicate this as follows: 



X = ad — be. 



a b 
c d 



-> ad — be, 



that is, 



5 : X > 5(X). 



The funetion 5 has interesting properties, some of whieh we shall 
demonstrate • 

First let us compute the values 6(X) for a few products: 
(a) If 



3 2 
1 2 



and B 



0 3 
2 1 



then 



6(A) = 3(2) - 2(1) = 4, 
6(B) = 0(1) - 3(2) = - 6, 



AB « 



3 


1 




"o 


3" 




'4 


11 ' 


1 


2 




2 


1 




4 


5 



(b) If 



5(AB) = 4(5) - 11(4) = - 24. 



-1 2 
0 3 



and B = 



8 2 
3 1 



then 



5(A) = - 1(3) - 2(0) = - 3, 
5(B) = 8(1) - 2(3) = 2, 
[sec. 2-5] 
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AB 



-1 2 

0 3 



8 2 
3 1 



9 3 



8(AB) = - 2(3) - 0(9) = 6. 

We might suspect that 6(AB) = 6(A) 8(B) '. This >^ true atjd we shall now ptov0 
it. 



Theorem 2-^6 . If A € M and B € M, theti 



Proof. Let 



6(AB) = 5(A) 5(B), 



A = 



a b 
c d 



B = 



P q 
r s 



Chen 



AB 



ap + br aq + bs 
cp + dr cq + ds 



5(AB) = (ap + br)(cq + ds) - (aq ts)(cp H« dr) 
= apcq + apds + brcq + brds 

— aqcp — aqdr — bscp bsdr 
= apds + brcq — aqdr — bscp, 

5(A) « ad - be, 
5(B) « ps - qr, 
5(A)5(B) = (ad - bc)(ps - qr) 

=s adps — adqr — bcps + bcqr* 

By rearranging the terms in expressions (1) and we see thac 

5(AB) « 5(A) 5(B). 

Let us look at more examples; let 
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3 2 
I 2 



and 



B = 



'0 3 
2 1 



In Section 2--4 learned that 



X = 



then 



-1 



1 

B(X) 



a -^,1 

aj 



Hence 



-1 
A ^ 



I 
2 
I 
4 



-1 



I i 

6 2 

i 0 
3 



Further, 



&(A) 3(2) - 2(1) ^ 4, 
5(B) = 0(1) - 3(2? ^ - 6, 

Theorem 2^ > If A is a ^X2 me^^^ix, and A has ^ multipUcative 
inverse, then 

^1 1 

> " &(aT • 

Proof. V/e have 

-1 t 

AA 



But by computi^S 6(1) > see that 
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5(1) = 1(1) - 0(0) 1, 



whence 



5(AA ^) = 1, 



so that by, Theorem 2-6, 



S(A) 5(A ^) = 1, 



or 



We shall now prove quite a significant theorem, which will g^^e the 
power to decide when a product aB has an inverse and v^hat the i^^^eirse is. 

Theorem 2-8. If A and B are 2/2 matrices, then aB ha5 an 
inverse if and only if A and b both ha^e inverses. Further, these 
ina trices have inverses, then 



Pi^oof . Since 



S(A) 6(B) = 5(AB), 



it follows that 



5(AB) ^ 0 



if and only if 



6(A) ^ 0 and &(B) ^ 0. 



Then by Theorems 2 A and 2.5 we see that AB has an inverse if ^nd only if 
A and b both have inverses. g 
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To complece the proof °f Theorem 2-3 ^ we assume that A and B have 
Inverses and B~^, respectively, and then have only to exhibit a matrix 

X such that 

. ABx » 1 = XAB. 



Let 



Then 



ABX =* ABb'^A ^ 

A(bb''^)A"^ 
" A(i)A~^ 

I. 



Hen^e b"^ A~^ is a right inverse. Similarly, we can show that 



A ^ AB = I. 



Thus b"^ is the inv^^Se of AB- This completes the proof, 

Jfof example, let 



0 I 
2 3 



and B = 



2 5 



3 I 
2 2" 

1 0 



and B 



-1 ^ 



3 

-1 2 



Nov 



AB ^ 



0 


l" 




2 


5*" 




^ 1 


3" 


2 


3 




1 


3 




7 


19 
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whence 



(AB) 



-1 



3 
19 



19 


3 


1 


2 


7 


1 


2 


2 
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But also, 



3 -5 
-1 2' 



3 


1 




2 


2 




1 


0 





19 

2 
7 
2 



Thus, for our example we have (AB) ^ ^ b"^ A ^. 

There are many other theorems that can be developed from the concept of a 
determinant function. A few of these will be included in the exercises that 
follow. It is worth noting, though we shall not prove it, that there is a 
determinant function associated with the other sets of square matrices, that is, 
with those of order 1, 3, 4,..., and that similar theorems hold for them. 
Specifically, there is a determinant function associated with each square matrix, 
and its nonvanishing is a necessary and sufficient condition for the matrix to 
have an inverse. 



Exercises 2—5 



1. Verify Theo rem 2—6 for the ma trices 

B = 



(a) A = 

(b) A = 

(c) A = 



2 ll 
3 



t-- 1 
-1 t 



2 -1 

X X 

x3 



, B = 



, B 



2 1 

4 3 



0 1 

1 0 



X —X 

3 4 



2. Show that 



for any A e M and any t e R 



5(tA) = t S(A) 
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3. For A and t as in Exercise 2, show that 5(A) is the constant term 
in the polynomial 6 (A tl) . 

4. If 



X 1 
x2 ^1 



and B = 



2 1 
-5 -2 



find 5(A) and 5(B AB) and show that they are equal. 
5. Show that if A e M, B e M, and B is invertible, then 



5(B ^AB) = 5(A). 



6. Show that if A e M and A^ is the transpose of A, then 



5(A) = 6(A ), 



and conclude that 



5(AA ) > 0 



for any A e M. 

7. The expression 5(A - tl) is a polynomial in t. For each of the follow- 
ing matrices A, expand this polynomial and find its zeros: 



(a) 

(b) 

(c) 
(d) 



1 2 

0 4 

-1 o' 

0 1 

t o' 

-t 1 

a 0 

0 b 



8. Let 



A - 



2 0 
-1 1 
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and o.xpand the polynomial B(AA^ — xl). Is this the same as the polynomial 
B(A^A — xl)? Are these two pblynomials the same for every matrix A e M? 



2— 6> The Group of Invertible Matrices 

In this chapter, we have been restricting our attention to the set M of 
2x2 matrices. This set is, itself, a subset of the set of all rectangular 
matrices. Now this set M can be separated into interesting subsets. In the 
preceding section, we have divided M into two complementary subsets, the set 
of 2x2 matrices that do not have inverses and the set of 2x2 matrices 
that do have inverses.' In this section, we shall confine out attention princi*- 
pally to the set of inveiT.ible 2x2 matrices. It is convenient to denote 
this set by the symbol . 

Let us summarize certain facts about the set of invertible matrices: 

(a) If A e M. , and B e M. , then AB € M. . 

(b) If A e M^, Be M^, ^ and C e M^, then A(BC) = (AB)C. 

(c) In M^, there is an identity element I, that is, an element 
I such that AI = A = lA for each A e M^, . 

(d) If A e M. , then A has an inverse A ^ € M . , that is, an 

^—1 —1 —1 

element A such that AA = I = A A. 

Not only does the set satisfy each of these conditions, but there are 
many subsets of that satisfy conditions analogous to them. Any set S of . 

matrices that satisfies conditions (a), (b), (c) , and (d), with S in place of 

will be called a group . The concept of a group is fundamental and extremely 
important in machematics. More generally, any set of elements A,B,C,..., not 
necessarily matrices, satisfying the foregoing properties relative to an opera- 
tion (not necessarily matrix multiplication) is defined to be a group. You will 
note that only one operation is involved in the group properties. Although we 
shall later introduce a few examples of the mort general concept, for the 
moment let us consider some examples of groups of invertible matrices. 

The smallest set of invertible matrices that constitutes a group is the set 
whose one elan en t is the unit matrix * Since (I) (I) »^ I, condition 

(a) is satisfied; and condition (b) is automatically fulfilled by any set of 
square matrices. Certainly I is a member of the set, so that condition (c) 
is satisfied. For condition (d), there must be an Inverse for every element; 
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this condition is satisfied since in our present set the only element I is its 
own inverse . 

All quite simple, isn't it? Was it obvious? 

Another set that constitutes a group is the set {I, -I}. Again conditions 
(b) and (c) obviou£.ly are satisfied. Since 

(I)(--I) = (-I)(I) = - I 

and 

(I) (I) = (-I)(-I) = I, 

conditions (a) and (d) also are satisfied. 

The third set that we shall show to be a group is a bit more significant. 
The set of all elements A e M such that 5(A) =1 is a group. The proof is 
a bit more difficult, and we must check carefully each one of the defining 
properties. To provide a language thai: will be helpful, let us denote this set 
by W, thus: 

W = {A: A e M and 5(A) = 1) . 



Let us verify first that condition (a) is satisfied. If A e W and S e W, 
then 6(A) => 1 and 5(B) = 1. Since 5(AB) 5(A) 5(B) by Theorem 2-6, we 
have 



5(AB) = 5(A) 6(B) - (1)(1) = 1, 



and thus AB e W. 

Property (b) holds automatically. 

For property (c) , since 5(1) =1, it is c3ear that I e W. 

To demonstrate that condition (d) is satisfied, we must show not* only that 
each element of W has an inverse but also that the inverse is an element of 
W. Now, if A € W, then 6(A) = 1. Since 6(A) 0, A has an inverse A , 
by Theorem 2-5. By Theorem 2-7, 

Hence A~ ^ e W, and we have now demonstrated that W is a group. 
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In our last example, we shall discuss all matrices of the form 



8? 



X y 

— y X 



(x,y = real numbers) 



and denote this set by Z, Z C M. (Read zCM as, "The set Z is contained 
in the set M.") 

We observe first that the product of any two members of this set Z is also 
a member of Z. We have, indeed. 



^1 


^1 




""2 


^2 


B3 


-^1 






'^2 


X2 





^1>^2 ^ ^1^2 



^1^2 " ^1^2 
-(x^yj^ + Xj^y^) -yi^y2 + ^Cj^x^ 

Condition (b) is automatically satisfied; and I is a member of Z, with 
X a» 1, y = 0, sc that condition (c) is satisfied. 

In considering condition (d), we run into trouble. The zero matrix is an 
element of this set, but the zero matrix does not have an inverse. The set of 
all matrici^s of the form 



X y 
-y X 



does not form a group , 

Although the set 2 does not satisfy the four conditions, a subset 
Z, defined by 



of 



Zj^ « (A: A € Z and 5(A) « 1), 

does satisfy the conditions and is therefore a group. 

The demonstration is eajy. Let A e Z^^ and B € 2^^. We know that 

AB € Z,i, as already shown; and, since 5(A) « 1 and 5(B) » 1, wc know that 

5(AB) " 1. Hence AB € Zj^, and therefore condition (a) is satisfied. Obviously, 

condition (b) also is satisfied. We know that I € Z and that 5(1) « 1; 

hence, I e Z, , so that condition (c) is satisfied. Finally, for condition (d), 

—1 —1 

we must show that if A 6 Z^ then there is an inverse A such that A 6 Z^. 
We follow the pattern set In an earlier illustration. Since 5(A) « 1, there is 



an inverse. Then, using the fact that 5(A ) » 1/5(A), we proceed to show 
that 5(A""^) " 1, which means that A*"^ e 1'^^ . Having demonstrated that the four 
groups postulates are satisfied, we conclude that we have a group. 
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Before considering the more general concept of a group, we shall demon- 
strate a fruitful correspondence between the elements of and the points on 
a unit circle, which will let us examine a geometric interpretation of . 

If 



X y 
^y 3c 



is any element of Zj^, we have 6(A) ^ 1; that is, we have 



X + y » i . 



Now, if we let x\and y be coordinates of a point (x,y), we are able to 
establish a one-to-one correspondence between the elements of and the 

points on a unit circle; 



X y 



(X, y). 



The set of matrices is thufs maoped onto the set of points in such a way that to 
each matrix there corresponds exactly one point of the set, and to each point of 
the set there corresponds exactly one matriJ^* 

The point (x, y) is on the circle of radius 1 with center at the origin, 
as shown in Figure 2-1 • 



(K, y) 




Figure 2-1. The unit circle* 



Let us call this circle the unit circle and denote it by q. 
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39. 



Thus 



Q {(x, y): X e yea, and -f y^ « l|. 
A geometrical meaning can be assigned to the inverse of any element of Z^. If 

then we can readily compute A*^^ by Theorem 2-5, to obtain 



,-1 



y X 



] 



Recalling the one-to-«one correspondence between the matrices of and Che 

points of Q (the unit circle), 



X y 
-y X 



(x> y), 



we see, by examining Figure 2-2, that the correspondent of A ^ is the reflec- 
tion in the x axis of the correspondent of A. 



A ^ 



(x, y) 




Figure 2-2. Geometric representation of inverse matrices A and A ^ € . 

Although a full discussion of the general notion of a group would be too 
extensive for this book, a few words are in order. The definition of an abstract 
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group is stated somewhat differently from the definition given on page 
85, although the abstract definition implies the latter. 

Definition 2-5 . A group is a set G of elements, a,b,c on which a 

binary operation o (read ''circle") is defined, such that the following 
properties are satisfied: 

(a) If a € G and b € G, then a o b € G. (Closure property.) 

(b) If a € G, b € G, and c € G, then 

a o (b o c) « (a o b) o c. (Associative property.) 

(c) There exists a unique element i, i € G, such that 

i o a » a = a o i for all a € G. (Identity property.) 

(d) For each a e G, there exists an element a ^, a ^ € G, 
such that a"""^ o a = i = a o a"*-*-. (Inverse property.) 

If, in addition, the following condition is fulfilled, the group is said to be 
commutative or abelian ; 

(e) For each a e G and each b€G, aob»boa. (Commutative 
property . ) 

Although the operations we are most concerned with in mathematics are 
addition and multiplication, we are not restricted to these in the foregoing 
abstract definition. For instance, a very helpful exercise, not only for under- 
standing the notion of a group but also for comprehending a finite number system, 
is the addition associated with a clock face; see Figure 2-3. This furnisher, 
us with a group. The set of elements is 1,2 12. The operation is clockwise 




Figure 2-3. A clock face. The addition associated with it gives us a group. 
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addition of hours. Each defining property of an abstract group is satisfied, as 
we shall now illustrate. First, the "sum" of any two elements is another 
element. For example, we have 

1+6=7, 
8 + 4 = 12, 
11 + 2a 1, 
3 + 12 = 3. 

Secondly sinc^, for example, 

(8 + 2) + 3 1 and 8 + (2 + 3) = 1, 

we see that the associative property holds. Thirdly, a full clock rotation, an 
advance of 12 hours, gives the same time, so that 12 is our unique identity 
element; thus, 

12 + 2 = 2 = 2 + 12. 

Finally, to each of the elements, 1,2,. ,.,12, there corresponds a number we can 
"add" to obtain 12. Thus 

4 + 8 = 12 = 8 + 4, 
10 + 2 = 12 = 2+10, 
12 + 12 = 12 « 12 +12. 

One of the most elegant examples of a group consists of the three cube 
roots of 1, namely 

1 - 1 + i -1 - i a/3 

2 2 

under multiplication. The demonstration is left to the student as an exercise. 

Interestingly enough, although the integers are the most commonly used 
system that has a group suructure (under the operation of addition), they were 
not the first to have their group structure analyzed. The first groups to be 
studied extensively were finite groups such as the two examples given above. 
These groups were found during a stud^ of tlVe theory of equations by Evariste 
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Galois (1811-1832), to whom is credited the origin of the systematic study of 
Group Theory. Unfortunately, Galois was killed in a dueL at the age of 21, 
immediately after recording some of his most notable theorems. 



Exercises 2—6 



1. Determine whether the following sets are groups under multiplication: 



(a) 



" 1 oi [-1 oi r 0 ri . 

0 ij ' [ 0 -ij ' [-1 oj ' 



(b) I, -I, K, -K, 



where 



2. Show that the set of all elements of M of the form 



t 0 
0 t 



, where t c R and t 0, 



constitutes a group under multiplication. 
3. Show that the set of all elements of M of the form 



where t e R, s e R, and t - s =1, 



constitutes a group under multiplication. 
4. If. 



A = 



1 V3 

2 2 
V3 1 

2 "2 



shov; that the set 



(A-, A^, A^} 
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is a group under multiplication. Plot the corresponding points in the 
plane. 

Let 

0 ll 

1 oj • 

Show that the set 

is a group under multiplication. Is this true if T is an^ invertible 
matrix? 

Show that the set of all elements of the form 

Q , with a 6 R, b e R, and ab « 1, 

is a group under multiplication. If you plot all of the points (a,b), 
with a and b as above, what sort of a curve do you get? 

Let 

and let H be the set of all matrices of the form 

xl + yK, with X e R and y € R. 

Prove the following: 

(a) The product of two elements of H is also an element of H. 

(b) The element xl + yK is invertible if and only if 

X — y tI* 0. 

2 2 

(c) The set of all elements xl + yK with x - y « 1 is a group 
under multiplication. 
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L -2 
6 1 



and K « 
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8. If a set G of 2x2 matrices is a group under multiplication, show that 
each of the following sets are groups under multiplication: 

(a) {A^: a € G], where A*" « transpose of A; 

(b) {b"~^ AB: A e G), where B is a fixed invertible element of M. 



9. If a set G of 2x2 m£ ' rices is a group under multiplication, show that 

-1 f 

(a) G = (A ^: A G G}, 

(b) G = {BA: a e G), where B is any fixed element of G. 

10. Using the definition of an abstract group, demonstrate whether or not each 
of the following sets under the indicated operation is a group: 

(a) the set of odd integers under addition; 

(b) the set R of positive real numbers under multiplication; 

(c) the set of the four fourth roots of 1, (1, —1, i, -i}, under 
multiplication; 

(d) the set of all integers of the form 3m, where m is an integer, 
under addition. 

11. By proper application of the four defining postulates of an abstract group, 
prove that if a, b, and c are elements in a group and a o b « a o c, 
then b =» c . 



2'-'7 . An Isomorphism between Complex Ntunbers and Matrices 

It is true that very many different kinds of algebraic systems can be 
expressed in terms of special collections of matrices . Many theorems of this 
nature have been proved in modem higher algebra. Without attempting, any such 
proof, we shall aim in the present section to demonstrate how the system of 
complex numbers can be expressed in terms of matrices. 

In the preceding section, several subsets of the set of all 2x2 matricefi 
were displayed. In particular, 7the "set Z of all matrices of the form 



^ ^ , X e R and y € R, 
-y xj * ' 

was considered. We shall exhibit a orier^to-one correspondence between the set 
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of all complex numbers, which we denote by C, and the set Z, This one-to-one 
correspondence would not be particularly significant if it did not preserve 
algebraic properties — that is, if the sum of two complex numbers did not 
correspond to the sum of the corresponding two matrices and the product of two 
complex numbers did not correspond to the product of the .:orresponding two 
matrices. There are other algebraic properties that are preserved in this 
sense. 

Usually a complex number is expressed in the form 

x+yi, 

where i » x e R, and y e R. Thus, if c is an element of C, the 

set of all complex numbers, we may write 

c = x(l) + y(i)- 

The numeral 1 is introduced in order to make the correspondence more apparent. 
In order to exhibit an element of Z in similar form, we must introduce the 
special matrix 



J « 



0 1 
-1 0 



Note that 



thus 



0 1 
-1 0 



0 1 
-1 0 



"1 0 
0 -1 



- 1 



1 0 
0 1 



- I; 



The matrix J corresponds to the number i, which satisfies the analogous 
equation 

This enables us to verify that J Q 
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which Indicates that any element of Z may be written In the form 



xl + yj. 



For example 9 we have 



[-3 2j • 



and 



0 5 
-5 0 



] 



Now we can establish a correspondence between C, the set of complex numbers, 
arid Z, the set of matrices: 

xl + yl xl + yj. 

Since each element of C Is matched with one element of Z, and each element 
of Z Is matched with one. element of C, we call the correspondence one-to-one 4 
Several special correspondences are notable: 
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0 = 0-1 + 0-1 0*1 + O.J = 0 

.1 a 1.1 +0.1 > 1*1 + O.J = 1 

\ 

1 = 0.1 + 1.1 <— ^ 0.1 + l.J « J 

As stated earlier, It Is Interesting that there Is a correspondence 
between the complex numbers and 2x2 matrices, but the correspondence Is not 
particularly significant unless the one-to-one matching is preserved In the 
operations, especially in addition and multiplication. We shall now follow the 
correspondence In these operations and demonstrate that the one-to-one property 
Is preserved under the operations. 

When two complex numbers are added, the real components are addftd, and the 
Imaginary components are added. Also, remember that the multiplication of a 
matrix by a number is distributive; thus, for a e R, b € R, and A e M, we 
have 

(a + b)A « aA + bA. 
Hence we are able to show our one-to-one correspondence under addition: 

12 .12 
« (Xj^ + lyj^ ) + (x^ + iy2) (^1^ + yi^) + (^z-"^ ^2^^ " 

« (xj^ + x^) + iy^ + y^)^ + ^2>'^ (yi ^2^^* 

For example, we have 

(2 - 3i) + (4 + 11) (21 - 3J) + (41 + IJ) 

= 6-21 > 61 - 2J. 

and 

(3 « 21) + (2 + 01) (31 - 2J) + (21 + OJ) 

= 5-21 <—> 51 - 2J. 

Before demonstrating that the correspondence Is preserved under multiplica- 
tion, let us review for a moment. An example will suffice: 
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(2 + 4i) (3 - 2i) « 6 - 4i + L2i - 8i 

» 6 - 4i + L2i 8(-l) 

« 6 + 8(1) + (--4 + L2)i 

« 14 + 8i; 



(21 + 4J)(3I - 2J) 



61^ - 4IJ + L2JI - 8J^ 



= 61 - 4J + 12 J - 8(-I) 
« 61 +81 + (-4 + 12)J 
« 141 + 8J. 



Generally, for multiplication, we have 



^1^2 

« (x^ + y^i) (x^ + y^^) 



x,x„ + yiyoi + x-y i + y,x i 



^1^2 



(x^I + y^J) + y^J) 

-2 ^ ,2 . ^ 
Xj^x^J. + y2^y2'^ + ^1^2 ^1^2 



w^^2 ^ ^1^2" ■ ^1-^2" ■ ^r2 



If we represent a complex number 



a + bi 



as a matrix, 



a + bi 



a b 
-b a 



we do have a significant correspondence'. Not only is there a one— to-one cor- 
respondence between the elements of the two sets, but also the correspondence 
is one— to-one under the operations of addition and multiplication. 

The additive and multiplicative identity elements arc, respectively, 



0 + Oi 



0 0 
-0 0 



and 
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and for the additive inverse of 



a + bi 



a b 
-b a 



we have 



- a - bi 



-a 
b 



-b 
—a 



Let us now examine how the multiplicative inverses, or reciprocals, can be 
matched. We have seen that any member of the set of 2x2 matrices has a 
multiplicative inverse if and only if for it the determinant function does not 
equal zero. Accordingly, if A e Z then there exists a" if and only If 
x^ + 0, since 5(A) » x^ + y^ for A - xl + yJ. Now we know that any 
com»,lex number has a multiplicative inverse, or reciprocal, if and only if the 
complex number is not zero. That is, if c - x + yi, then there exists a 
multiplicative inverse of o if and only if x + yi 0, which means that x 
and y are not both 0. This is equivalent to saying that x +y 5« 0, 
since x e R and y e R. For multiplicative inverses, if 



x^ + y^ 0, 



our correspondence yields 



■ X + yi 



xl + yJ - Z, 



- - -2 o (x - yl) 

1 X + y 



(XI - yJ) - Zl 

X + y 



rl 



It is now clear that the correspondence between C, the set of complex 
numbers I and Z, a subset of all 2x2 matrices. 



X + yl <— ^ xl + yJ, 
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is preserved under the algebraic operations. All of this may be summed up by 
saying that C and Z have identical algebraic structures. Another way of 
expressing this is to say that C and Z are isomorphic . This word is derived 
from two Greek words and means '*of the same form." Two number systems are 
isomorphic if, first, there is a mapping of one onto the other that is a one— to— 
one correspondence and, secondly, the mapping preserves sums and products. If 
two number systems are isomorphic, their structures are the same; it is only 
their terminology that is different. The world is heavy with examples of iso— 
morphisms, some of them trivial and some quite the opposite. One of the sxu^plc??^ 
is the isomorphism between the counting numbers and the positive integers, a 
subset of the integers; another is that between the real numbers and the subset 
a + Oi of all complex numbers. (We should quickly guess that there is an 
isomorphism between real numbers a and the set of all matrices of the form 



An example of an isomorphism that is more difficult to understand is that 
between real numbers and residue classes of polynomials. We won't try to explain 
this one; but there is one more fundamental concept that can be introduced here, 
as follows. 

We have stated that the real numbers are isomorphic to a subset of the cont- 
plex numbers. We speak of the algebra of the real numbers as being embedded in 
the algebra of complex numbers. In this sense, we can say that the algebra of 
complex numbers is embedded in the algebra of 2x2 matrices. Also, we can 
speak of the complex numbers as being' "rither" than the real numbers, or of the 
2x2 matrices as being richer than the complex numbers. The existence of 
complex numbers gives us solutions to equations such as 



which have no solution in the domain of real numbers. It is of course clear 

that Z is a proper subset of M, that is, ZC M and Z / M. Here is a simple 

example to illustrate the statement that M is "richer" than 2: The equation 



al + OJl) 



x^ -f- I « 0, 



- 1 « 0 



has for solution any matrix 



0 



t 
0 



t G R and t f< 0, 



1/t 
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as way be seen quickly by easy computation; and there are still other solutions. 
On the other hand, the equation 



X - 1 = 0 

has exactly two solutions among the complex numbers, namely c « 1 and 
c =« - 1. 



^ * on and 
,d matrices 



Exercises 2—7 

1. Using the following values, show the correspondence * 
multiplication between complex numbers of the form v' 
of the form xl + yJ: 

(a) Xj^ « 1, « - 1, x^ = 0, and y^ « - 2; 

(b) Xj^ = 3, yj^ » - 4, x^ = 1, and y^ « U 

(c) Xj^ « 0, yj^ = - 5, x^ » 3, and y^ ° 4. 

2. Carry through, in parallel columns as in the text, the necessary computa- 
tions to establish an isomorphism between R and the set 



by means of the correspondence 



X <— > 



X 0 
0 X 



3, In the preceding exercise, an isomorphism between R and the sets of 
matrices 



was considered. Define a function f by 



X 0 
0 0 
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Determine which of the following statements are correct: 

(a) f(x + y) » f(x) + i{y), 

(b) f(xy) - ffx) f(y), 

X. 0, 
" 1, 



(c) f(0) 
' (d) f(l) 

4. Is the Qet G of. matrices 



a bl 
2 2 

with a and b rational and a + b « 1, a group under multiplication? 



2-3 . Algebras 

The concepts of group, ring, and field are of frequent occurrence In modeim 
algebra. The study of these systems Is a study of the structures or patterns 
that are the framework on which algebraic operations are dependent* In this 
chapter, we have attempted to demonstrate how these same concepts describe the 
structure of the set of 2x2 matrices, which is a subset of the set of all 
rectangular matrices. 

Not only have we Introduced f>>«*se embr£:c:ing concepts* but we have exhibited 
the '^algebra" of the sets, "Algebra'* lit a asnerlc word that is frequently used 
in a loose sense • By technical deflnltlon^n algebra is a system that has tx^ 
binary operations, called "addition'* and ' multiplication! " and also has 
^'multiplication by a number,'* that make it: both a ring and a vector space. 

Vector spaces will be discussed in Chapter 4, and we shall gee then that 
the set of 2x2 matrices constitutes a vector space under matrix addition 
and multiplication by a number. Thus the 2^2 matrices form an algebra. 

As you yourself might conclude at this time, this algebra 1» only one of 
many possible algebras. Some of these algc^^ras are duplicates at Qne another 
in the sense that the basic structure of ot^it ts the same as the basic structure 
of anat^^r^ Superficially, they seem di&^vmt because of the CESCiinology, 
When th«V have the saae structure, two b^^c^b are called isomarphic* 



chapter 3 
MATRICES AND LINEAR SYSTEMS 



3-1. Equivalent Systems 

In this chapter, we shall demonstrate the use of matrices In the solution 
of systems of linear equations. We shall first analyze some of our present 
algebraic techniques for solving these systems, and then show how the same 
techniques can be duplicated In terms of matrix operations. 

Let us begin by looking at a system of three linear eciuntions: 

X - y + z -2, W 
1 X - 2y - 2z » -1, (2) 

2x + y + 3z - 1 . (3) 

In our first step toward finding the solution srt of Chia system, m proceed 
as follows: Multiply Equation (1) by 1 to oW,n Ee«a£±on (1'); imltlply 
Equation (1) by -1 and add the new equation to SquarliOT (2) to obtain Equation 
(2"); multiply Equation (1) by -2 and add the new atpss^^ to Equation (3) to 
obtain. Equation (3'). This gives the following B»3«s=a!t. 

X - y + z - -2, 
XI ; 0 - y -- 3z - 1, (2") 

0 + 3y + z ■ 5. (3') 

Before continuing, we note that what we have dwe 1* reversible. In fact, we 
can obtain System I from System II as follows: :mm^f Equation (1') by 1 
to obtain Equation (1); add Equation (1') to Equ*«oi^ to obtain Equation 

(2); multiply Equation (1") by 2 and add to EqiiiJtion v ') to obtain Equation (3). 

* Our second step Is similar to the first: K^teSi Equation (1") as Equation 
(1"); multiply Equation (2') by -1 to obtain Equate a") 5 multiply Equation 
(2') by 3 and add the new equation to Equation (3') zo obtain Equation (3"). 
This gives 113 



X - y + z - -2, (111) 

III 0 + y + 32 a -1, (2") 
0 + 0 - 8z « 8 . (3H) 

Our third step reverses the direction; Multiply Equation (3") by - 1/8 
to obtain Equation (3'"); multiply Equation (3'») by 3/8 and add to Equation 
(2") to obtain Equation (2'"); multiply Equation (3") by 1/8 and add to 
Equation (1") to obtain Equation (1"'), We thus get 

X - y + 0 « -1, (II" ) 

IV 0 + y + 0 « 2, (2»" ) 
0 + 0 + 2 « -1. (3m ) 

Now, by retaining the second and third equations, and adding the second equatic 
to the first, we obtain 

X + 0 + 0 « 1, 

V o+y+0»2, 
0 + 0 + 2 « -1, 

or, in a more familiar form, 

X » 1, 
y « 2, 
2 « -1, 

In the foregoing procedure, we obtain system II from system I, III from 
II, IV from III, and V from IV, Thus we know that any set of values that 
satisfies system I must also satisfy each succeeding system; in particular, 
from system V we find that any (x, y, 2) that satisfies I must be 

(1» 2, --1). 

Accordingly, there can be no other solution of the original system I; if there 

is a solution, then this is it. h -4 a 
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But do the values (X, 2, -L) actually satisfy system I? For our 
systems of linear equations we have already pointed out that system I can be 
obtained from system II; similarly, II can be obtained from III, III from IV, 
and IV from V. Thus the solution of V, namely (1, 2, -1), satisfies I. 

Of course, you could verify by direct substitution that (i, 2,-1) 
satisfies the system I, and actually you should do this to guard against com- 
putational error. But it is useful to note explicitly that the steps are 
reversible, so that the systems I, II, III, IV, and V are equivalent in 
accordance with the following definition: 

Definition 3-1 * Two systems linear equations are said to be equivalent 
if and only if each solution of either system is also a solution of the other. 

We know that the foregoing systems I through V are equivalent because the 
steps we have taken are reversible. In fact, the only operations we have per- 
formed have been of the following sorts; 

A. Multiply an equation by a nonzero number. 

B. Add one equation to another. 

Reversing the process, we unc the addition by subtraction and the multi- 
plication by division. 

Actually, there is another operation we shall sometimes perform in our 
systematic solution of systems of linear equations, and it also is reversible: 

C. Interchange two equations. 
Thus, in solving the system 

y + z = 4, 
X + 2y + z = 3, 
X - y + z =» 1, 

our first step would be to interchange the first two equations in order to have 
a leading coefficient differing from zero. 

In the present chapter we shall investigate an orderly method of elimina- 
tion, without regard to the particular values of the coefficients exceprt that we 
shall avoid division by 0 . Our mechod x>?ill be especially useful in dealing 
with several systems in which corr^onding coefficients of the variables are" 
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equal while the right-hand members are different — a situation that often 
occurs in industrial and applied scientific problems. 

You might use the procedure, for example, In "programming," i.e., devising 
a method, or program, for solving a system of linear equations by means of a 
modem electronic computing machine. 



Exexclses 3—1 
1. Solve the following systems of equations: 





3x + 4y w 


4, 






(b) 


X 


- 2y 


- 3, 








5c: + 7y « 


1; 










y 


- 2; 






(c) 


X + y — 


z «■ 


3, 




{d) 


X 


- 3y 


+ 2z 




6, 




2y + 


z = 


10, 








y 


— z 


■Si- 


- A. 




5x — y — 


2z " 


- 3; 










z 




7; 


(e) 


X + 2y + 


z — 


3w = 


2, 


(f) 


Ix 


+ Oy 


+ Oz 




Ow » a, 




y - 


2z - 


W " 


7, 




Ox 


+ ly 


+ Oz 




Ow w b. 






z — 


2w «= 


0, 




Ox 


+ Oy 


+ Iz 




Ow ■» c. 








w «• 


3; 




Ox 


+ Oy 


+ Oz 




Iw w d , 



Solve by drawing graphs: 
(a) 



X + y 
X - y 



2, 

2; 



(b) 3x - y » 11, 
5x + 7y - 1. 



3. Perform the following matrix multiplications: 



(a) 



10 0 
0 10 
0 0 1 



(b) 



10 0 
0 10 
0 0 1 



a X 
b y 
c z 



A. Given the following systems A and B, obtain B from A by steps consisting 
either of multiplying an equation by a nonzero constant or of adding an 
arbitrary multiple of an equation to another equation: 



-xi" 2, 
y « 3, 

-1; 



2x - 3y + z « - 6, 
B: X + 2y - 2 - 9, 

3x + y + 3z « 6. 
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Are the two systems equivalent? Why or why not? 

5. The solution set of one of the following systems of linear equations is 
empty, while the other solution set contains an infinite number of 
solutions. See if you can determine which is which, and give three 
particular numerical solutions for the system that does have solutions: 

(a) X + 2y - z 3, (b) x + 2y - z = 3. 

X - y z = 4, X - y + z = 4, 

4x y + 2z = 14; 4x - y + 2z = 15. 



3—2. Formulation in Terms of Matrices 

In applying our method to the solution of the original system of Section 
3—1, namely to 

x - y + z=-2, 
X - 2y - 2z = - 1, 
2x + y + 3z = 1, 

we carried out just two types of algebraic operations in obtaining an equivalent 
sys tem: 

A. Multiplication of an equation by a number other than 0. 

B. Addition of an equation to another equation. 

We noted that a third type of operation is sometimes required, namely: 

C. Interchange of two equations. 

This third operation is needed if a coefficient by which we otherwise would 
divide is 0, and there is a subsequent equation in which the same variable 
has a nonzero coefficient. 

The three foregoing operations can, in effect, be carried out through 
matrix operations. Before we demonstrate this, we shall sec how the matrix 
notation and operations developed in Chapter 1 can be used to write a system 
of linear equations in matrix form and to represent the steps in the solution. 

Let us again consider the system we worked with in Section 3-1: 
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X - y + =^ - 2 , 
X - 2y - 2z « - 1, 
2x4- y + 3z = T . 

We may display the detached coefficients of x, y, anu us a matrix A, 
namely 



1 -1 1 

1 -2 -2 

2 13 



Mexty let U8 consider the matrices 



X = 



and 



-2 
-1 
1 



the entries of X are the variables x, y, z, and of B are the right-hand 
members of the equations we are considering. By the definition of matrix 
multiplication, we have 





1 


-1 


1 




X 




X 


- y 


+ 


z 




AX « 


1 


-2 


-2 




y 




X 


- 2y 




2z 






2 


1 


3 




z 




2x 


+ y 


+ 


32 





which is a 3X1 matrix having as entries the leftr-hand members of the 
equations of our linear system. 
Now the equation 



AX B 



(1) 



that is, 



1 -1 

1 -2 

2 1 



is equivalent, by the definiticn of equality of matrices, to the entire system 
of linear equations. It is an achievement not to be taken modestly that we are 
able to consider and work with a large system of equations in terms of such a 
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simple representation as is exhibited in Equation (1). Can you see the pattern 
that is emerging? 

In passing, let us note that there is an interesting way of viewing the 
matrix equation 



AX « Y, 



(2) 



where A is a given 3x3 matrix and X and Y are variable 3x1 column 
matrices. We recall that equations such as 

y = ax + b 



and 



y = sin X 



define functions, where numbers x of the domain are mapped into numbers y of 
the range. We may also consider Equation (2) as defining a function, but in this 
case the domain consists of column matrices X and the range consists of column 
matrices Y; thus we have a matrix function of a matrix variable! For example, 
the matrix equation 



1 


-1 


1 




X 




X 


- y 


+ z 




u 


1 


-2 


-2 




y 




X 


- 2y 


- 2z 


S3 


V 


2 


1 


3 




z 




2x 


+ y 


+ 3z 




w 



defines a function with a domain of 3x1 matrices 



X 

y 

z 



(3) 



and a range of 3X1 matrices 



u 




X — 


y 


+ 


z 




V 




X — 


2y 




2z 


• 


w 

L J 




2x + 


y 


+ 


3z 





Thus with any column matrix of the form (3), the equation associates a 
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column matrix of the form (4). 

Looking again at the equation 



AX « B, 



where A is a given 3x3 matrix, B is a given 3x1 matrix, and X is 
a variable 3x1 matrix, we note that here we have an inverse question: What 
matrices X (if any) are mapped onto the particular B? For the case we have 
been considering, 



1 -1 1 

1 -2 -2 

2 13 



-2 
-1 
1 



we found in Section 3—1 that the unique solution is 



1 
2 
-1 



We shall consider some geometric aspects of the matrix— function point of 
view in Chapters and 5. 

We are now ready to look again at the procedure of Section 3—1, and restate 
each system in terms of matrices: 



1-11 

1 -2 -2 

2 13 



1 -1 
0 -1 
0 3 



1-11 
0 13 
0 0-8 



1 -1 

0 1 

0 0 

1 0 
0 1 
0 0 





X 




-2 




y 


m 


1 




z 




5 




X 




-2 




y 


m 


-1 




z 




8 




X 




-1 




y 




2 




z 




-1 




X 




1 




y 


n 


2 




z 




-1 



The two— headed arrows ^ ^ indicate that, as we saw in Section 3—1, the matrix 
equations are equivalent. 

In order to see a little more clearly what has been done, let us look only 
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at the first and last equations. The first Is 



1-11 

1 -2 -2 

2 13 



--2 
-1 
1 



the last Is 



1 0 
0 1 
0 0 



1 

2 
-1 



Our process has converted the coefficient matrix A Into the Identity matrix 
I. Recall that 



A « I. 



Thus to bring about this change we must have completed operations equivalent 
to multiplying on the left by A ^. 

In brief, our procedure amounts to multiplying each member of 



AX « B 



on the left by A ^, 



a"^AX » A 



to obtain 



X « A ^B. 



Let us note how far we have come. By Introducing arrays of numbers as 
objects In their own right, and by defining suitable algebraic operations i we 
are able to write complicated systems of equations in the simple form 

AX « B. 

This condensed notation, so similar to the long-familiar 
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ax a b, 

which we learned to solve "by division," indicates for us our solution process; 
namely, multiply both sides on the left by the reciprocal or inverse of A, 
obtaining the solution 

X =« A~^B, 

This similarity between 

ax b 

and 

AX = B 

should not be pushed too far, however. There is only one real number, 0, that 
has no reciprocal; as we already know, there are many matrices that have no 
multiplicative inverses. Nevertheless, we have succeeded in our aim, which is 
perhaps the general aim of mathematics: to make the complicated simple by 
discovering its pattern. 



Exe r c i s es_3r-2 



1. Write in matrix form: 



(a) 4x - 2y + 7z =2, 
3x + y + 5z = -1, 
6y — z = 3; 



(b) X + y « 2, 
X - y = 2 . 



2. Determine the systems of algebraic equations to which the following matrix 
equations are equivalent; 



(a) 


3 


4 


5 




X 




1 


(b) 


3 


2 


-2 




X 


u 




1 


2 




1 


2 


3 




y 




0 


> 


2 


-1 






y 


V 


s 


2 


3 




0 


1 


2 




z 




2 




-1 


1 


5 




z 


w 




3 


1 



3. Solve the following system of equations; then restate your work in matrix 
form: j^22 
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x+y+z— w«l, 
X - y + 3?. + 2w a 2, 
2x + y + 3z + w « --2, 
X - 2y + z 4- 3w « 10 . 

4. (a) Onto what vector Y does the function defined by 

map the vector [^j « [^j vector ^ j does it map onto the 

vector Y = [^J ? 

5. Let A j^aj^ a2 a^ aj , Y » [yj , a^, x^, y^^ e r. Discuss the 
domain of the function defined by 

AX :« Y. 

Define the inverse function, if any. 



3—3, Inverse of a Matrix 

In Section 3—2 we wrote a system of linear equations in matrix form, 

AX « B, 

and saw that solving the system amounts to determining the Inverse matrix A ^, 
If It exists, since then we have 

A^^AX e B, 

whence 

X =» a'^^b- 

Our work Involved a series of algebraic operations; let us learn how to duplicate 
this work by a series of matrix alterations. To do this, we suppress the column 
matrices In the scheme shown In Section 3-2 and look at the coefficient matrices 
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on the left: 



1-11 




1-11 




1 -1 1 




1 --1 0 


<=> 


10 0 


I -2 -2 




0 -1 -3 




0 13 


<=> 


0 10 


0 10 


2 13 


0 3 1 




0 0-^ 




0 0 1 




0 0 1 



Observe what happens if we substitute "row" for "equation" In the procedure of 
Section 3-1 and repeat our steps. In the first step, we multiply row 1 by —1 
and add the result to row 2; multiply row 1 by —2 and add to row 3. Second, we 
multiply row 2 by 3 and add to row 3; multiply row 2 by -1. Third, we multiply 
row 3 by^ 3/8 and add to row 2; multiply row 3 by 1/8 and add to row 1; 
multiply row 3 by - 1/8. Last, we add row 2 to row 1., Through "row operations" 
we have duplicated the changes In the coefficient matrix as we proceed through 
the fineries of equivalent systems. 

The three algebraic operations described In Section 3—2 are paralleled by 
three matrix row operations ; 

Definition 3-2 . The three row operations, 
Interchange of any two rows, 

Multiplication of all elements of any row by a nonzero constant. 
Addition of an arbitrary multiple of any row to any other row, 

are called elementary row operations on a matrix. 

In Section 3— 5> the exact relationship between row operations and the 
operations developed In Chapter 1 will be demonstrated. Earlier we defined 
equivalent systems of linear equations; In a corresponding way, we now define 
equivalent matrices • 

Definition 3-3 . Two matrices are said to be row equivalent If and only If 
each can be transformed Into the other by means of elementary row operations. 

We now turn our attention to the right— hand member of the equation 

AX = B. 

At the moment, the right— hand member consists solely of the matrix B, which 
we wish temporarily to suppress just as we temporarily suppressed the matrix 
X In considering the left-hand member. Accordingly, we need a coefficient 
matrix for the right-hand member. To obtain this, we use the Identity matrix 
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to write the equivfllent equation 



AX = IB. 



(1) 



Now our process converts this to 



X = A ^B, 



which can be expressed equivalently as 



IX 



A ^B, 



(2) 



When we compare Equation (I) and Equation (2) we notice that as A goes to I, 
I goes to A ^. Might it be that the row operations, which convert A into I 
when applied to the left-hand member, will convert I into a"^^ when applied 
'^Vto the right— hand member? Let us try. For convenience, we do this in parallel 
columns, giving an indication in parentheses of how each new row is obtained: 



I -I I 

1 -2 -2 

2 13 



(Rj^; -IR^; 3R2 + R3): 



I -I I 
0 -I -3 
0 3 1 



I -I I 

0 13 
0 0-8 



1 3 I 

( g. R3 + Rj^; 8 S ^2' 8 ^3^' 



I 

0 I 

0 0 



0 0 
0 
I 



I 

0 



I -1 

-5 3 



0 0 



0 0 
0 



(IR^ + Rj^ ; R2» ' 



I 


-I 


0 






0 


I 


0 






0 


0 


2 










J 





3 
8 
7 
8 
5 
8 



3 
8 
1 
8 
3 
8 
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10 0 
0 10 
0 0 1 





4 


4 




8 


8 




1 


3 




8 


8 


SB 


3 


1 


8 


8 



To deti»:>t^i ate that 



2 

5 





4 




8 




3 




8 


3 


1 


3 


8 



is a li^jft^and inverse for A, it is necca^ary to show that 



BA 



4 


4 


4 


[00 


8 


8 


7 


1 


3 


8 


8 


8 


5 


3 


1 


8 


8 


8 



1-11 

1 -2 -2 

2 13 



= I 



You are asked to verify this as an exercise, and also to check that AB = 1] 
thus demonstrating that B is the inverse A ^ of A. In Section 3-5 we shall 
see why it is that AB = I follows from BA = I for matrices B that are 
obtained in thi^ way as products of elementary matrices. 

We now have the following rule for computing the inverse A ^ of a matrix 
A, if there is one: Find a series of elementary row operations that convert 



A into the identity matrix I; the same series of elementary row operations 
will convert the identity matrix I into the inverse A ^. 

When we start the process we may not know if the inverse exists. This 
need not concern us at the moment. If the application of the rule is successful, 
that is, if A is converted into I, then we know that A ^ exists. In sub- 
sequent sections, we shall discuss what happens when A ^ does not exist and we 
shall also demonstrate that the row operations can be brought about by means of 
the matrix operations that were developed in Chapter 1. 
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1. Determine the inverse of each of the fdilcx>^:- >^^mzr=tQ:S through :=dw 
cperations. (Check your answers.) 



(a) 



(c) 



2 3 
-1 2 



10 3 
2 10 
-2 0 0 



(b) 



(d) 



4i. 



^1 



n 1 

I 'i 
0 i 



2. Determine the inverse, if any, of each of tht rtwlng matrices: 



(a) 



(c) 



(e) 



2 2 
-1 -Jl 



10 3 
2 3 4 
13 1 



1 4 7 

2 3 6 
5 1-1 



(b) 



(d) 



3. Solve each of the following matrix equations: 

(b) 



(d) 



(a) 


1 


4 


7 




X 




3 




2 


3 


6 




y 


83 


0 




5 


1 


-1 




z 




7 


(c) 


1 


4 


7 




X 




1 




2 


3 


6 




y 




3 




5 


1 


-1 




z 




--4 



i ' 



C ! 
1 J 



1 




7 




X 




3 




2 




o 




y 




0 




5 


1 






z 




-2 




























r- 








1 


4 






X 




D 




2 


3 






y 


o 


D 




5 


\ 






Z 









Solve 



1 -1 1 

1 -2 -2 

2 13 



X u m r 
y V n s 
z w p t 



1 -6 2 



-1 

3 



1 0 



5. Perform the multiplications 
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(a) 



3 2-2 
2-1-4 
-115 



1 


12 


10 


17 


17 


17 


6 


13 


8 


lt" 


17 


^ 17 




5 


7 


W 


17 


17 



(b) 



1 


12 


10 


17 


17 


17 


6 


13 


8 


17 


17 


17 


1 


5 


7 


17 


17 


17 



3 2 



2 1-^ 



-1 1 



6. Multiply both members of the matrix equation 



1 12 10 
6-13-8 
-15 7 



X 




17 


y 




0 


z 




0 



on the left by 



3 2 -2 
2 -1 -4 
-115 



and use the result to solve the equation. 
Solve: 

2x + y + 2z - 3w « 0, 
4x+ y+ z+ wel5, 
5x-. y-- z — w»5, 
4x - 2y + 3z - w « 2 • 



8. Solvft: 



9x - y ^ 37, 
8y - 2z « --4, 
7z - 3w ^ -17, 
2x + 6w « 14. 
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.3rA. Linear Systems of Equatiomi 

In Sections 3—2 and 3-3, a i?roce.cj:re for solving a system inear sqtattli^ns , 

« B, 

was presented. The method produces t±sr mul.tirlicative inverse if it 

exists . 

In the present section we shall c::=siderr systems of linear equations tXL 
general . 

Let us begin by looking at a simp^a illustration. 

iExample 1 . Consider thc» system cz equations, 

2x - 3y + 42 « 5, 
2x + 7y - 2z « 1, 
2x + 2y + z = 3. 

We start in parallel columns, thus: 



2-3 4 
2 7-2 
2 2 1 



10 0 
0 10 
0 0 1 



Proceeding, we arrive afterr three steps at the following: 



1 0 
0 1 

0 0 



11 

10 

3 

5 



20 

10 
1 

2 



20 ^ 



If we multiply these txjo matrices on the ri^t by the matrines 



and 



resp€ 



we obtain Che system 
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"'To ^ " 10 ' 

3 2 
y - 5 ^ - 5 ' 

0 = 0. 

There is no matsismatical .ass in dropptag the equation 0=0 irom the system, 
which then can ~e written aquivalenriy as 




Whatever value:±s given to z, th-is value and the corresponisag values of x 
and y determined by these equations ssaisfy tiie original syssam. For example, 
a few solutions are shown in the fcllowing table;: 



z 


X 


7 




41 


8 


-2 


10 


5 




4 


1 


I 


'5 


5 




19 


2 


0 


10 


5 




3 i 


4 ' 




10 


5 



Example 2. Now consider "riaes systems 



cr — + ^ ^ 4, 
te:— 37 + 2z 14. 



If we start:, in paral^ 
verify) \ 



CO 



1 0 
0 1 

0 0 



lumnff ssd OTDceed bedfore^ we cteain (as you should 
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i -i 0 

-1 -3 1 



IZL 

Multiplying these matrices on the right by the column matrices 



X 

y 

z 



and 



3 
4 
14 



ve obtain the systems 




0 = - 1. 



If itkere were a solution of the original system, it would also be a solution of 
this JIast system, which ±s equivalent tx) the first. But the latts^ system cort— 
tains the contradiction 0 = — 1; hence there is no solution of e±idier system. 

3o you have an intuitive geometric notion of what mighr. be goli ig on in 
sach of the above systems? Relative to a 3— dimensional rectsngulsr coordinate 
system, each of the equations represents a plane. Each psir of tfesse planes 
actually intersect in a line-. We might hope that the thrse lines irf intersectixm 
(in, each system) would intersect in a point. In the firsr system, however, the 
three::.lines are coincident; there is an entire '*line" of ^srlutions. On the 
other hand, in the second system, the three Lines ajre pac^J-el; there is no 
point that lies on all three planes. In the example worke: ^^it: in Sections 
3—2 and 3—3, the three lines intersect in a single point. 

How many possible configurations, as regaxds intersecrf.otrjH^ can you list 
for 3 planes, not necessarily distinct frcx one another? t^tifiy- :rrlght, 'i^v example, 
have exactly one point in common; or two might be co5 ^ cider one: tte t±±rd 
distinct from but paxallel to theroj and so on. ZTh^Eir^e sre sy ^tens ci£ linear 
fequa±ii3ns that corrss:pond to each of these: geometric: i :^itua£x-X)ns , 

^ISsze are two additional systems that even more cfrvviausiy than the system 
inrExsmple 2, above, have no solutions: 

X = 2, X -r y +:z = 2, 

x=3; x+y + 2:=3. 

ThuS'Vou see that the number of variables as cw^rqpared ^ribth the number of equations 

does not determine whether or not thece: iis: sislttKt:ian . 
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The method we have developed for solving linear systems is routine and can 
be applied to systems of any number of linear equations in any number of vari- 
ables. Let us examine the general case and see what can happen. Suppose «e 
have a system of m linear equations in n variables; 

a^x^ + a2X2 + ••• + a^x^ = 



Vl +'°2^2 + Vn = ^'m 

If any variable occurs with coefficient 0 in every equat::::on, we drop it. 2: 
a coefficient needed as a divisor is zero, we interchange anr equations; we miaht 
even interchange some of the terms in all the equations =^ the convenience or 
getting a nonzero coefficient in leading position. Othervriaa we can proceed in 
the customary manner until there remain no lequations in imich any of the vari- 
ables have nonzero coefficients. Eventually we have a systea like this: 



linear terms in variables other than \ ~ ^1' 

^2 ^ 



and (possibly) other equations in which no vaxiable appears^ 

Either all of these other equations (if there are any) .^tn^ of tha f ox:r 

0 « 0, 

in which case they can be disregarded, or there is at Least one of them ax the 
form 

0 a b , h Xl^r 

in which case there is a contradiction. 

If there is a contradiction, the contradiction-was preseitt ±xl the..cr±ginal 
system, which means there is no solution set. 

1 3 



In case there are no equatioas of the form 



.123 



b, 



b 9^ 0, 



we have two possibilities. Either there are no variables other than x 
which means that the system reduces to 



1 \ 



and has a unique solution, or there really are variables other than Xj^,,,.,Xj^ 
to which we can assign arbitrary values and obtain a family of solutions as in 
Example above. 



Exercises 3-A 

1. (a) List all possible configurations, as regards intersections, for 3 

distinct planes. 

(b) List also the additional possible configurations if the planes are 
allowed to be coxncident. " * ' 

2. Find the solutions, if any, of each of the following systems of equations: 



(a) X + y + 2z =» 1, 
2x + 2 = 3, 

3x + 2y + 4z « 4; 



(c) X — 2y + z = 1, 
2x + y - z a 1, 
X + 2y + 2z a 2; 



(b) X + y + z = 6, 
X + y + 2z « 7, 
y + z « 1; 



(d) 2v + X + y + z « 0, 

V— x + 2y+ z«0, 

4v - X + 5y + 3z « 1, 

V — x + y — za2; 
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(e) 2x+ y + z+w«2, 

x + 2y+ z — w = -l> 

4x + 5y + 3z - w » 0 • 



3— 5> Elementary Row Operations 

In Section 3-L, three types of algebraic operations were listed as being 
involved in the solution of a linear system of equations; in Section 3-4, three 
types of row operations were used when we duplicated the procedure employing 
matrices. In this section we shall show how matrix multiplication underlies 
this earlier work, — in fact, how this basic operation can duplicate the row. 
operations . 

Let us start by looking at what happens if we perform any one of the three 
row operations on the identity matrix I of order 3. First if we multiply a 
row of I by a nonzero number n, we have a matrix of the form 



n 


0 


0 




1 


0 


0 




1 


0 


0 


0 


1 


0 


t 


0 


n 


0 


, or 


0 


1 


0 


0 


0 


1 




0 


0 


1 




0 


0 


n 



Let J represent any matrix of this type. Second, if we add one row of I to 
another, we obtain 



1 


"0 


0 




1 


0 


0 




1 


1 


0 


1 


i 


0 


> 


0 


1 


0 


9 


0 


1 


0 


0 


0 


1 




1 


0 


1 




0 


0 


1 



or one of three other similar matrices. (What are they?) Let K stand for 
any matrix of this type. Third, if we interchange two rows, we form matrices 
(denoted by L) of the form 



0 10 
10 0 
0 0 1 



10 0 
0 0 1 
0 10 



0 0 1 
0 10 
10 0 



Matrices of these three types are called elementary matrices . 



Definition 3-^ . An elementary matrix is any square matrix obtained by 
performing a single elementary row operation on the identity matrix. 
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Each elementary matrix E (that is, each J, K, or L) has an inverse, 
that is, a matrix E ^ such that~~ 



E^E=I = EE^. 



For example, the inverses of the elementary matrices 





1 


0 


0 




1 


0 


0 




0 


1 


0 


J = 


0 


n 


0 


, K = 


0 


1 


1 


, and L = 


1 


0 


0 




0 


0 


1 




0 


0 


1 




0 


0 


1 



are the elementary matrices 





1 


0 


0 




1 


0 


0 




0 


1 

n 


0 




0 


1 


-1 




0 


0 


1^ 




0 


0 


1 



and L 



-1 



0 10 
10 0 
0 0 1 



respectively, as you can verify by performing the multiplications involved. 

An elementary matrix is related to its inverse in a very simple way. For 
example, J was obtained by multiplying a row of the identity matrix by n; 
J^^ is formed by dividing the same row of the identity matrix by -n-. In a 
sense "undoes" whatever was done to obtain J from the identity matrix; 



conversely, J will undo J 



-1 



Hence 



J = I = Jj""^, 



The product of two elementary matrices also has an inverse, as the follow- 
ing theorem indicates • 

Theorem 3—1 . If square matrices A and B of order n have inverses 
A^^ and B ^, then AB has an inverse (AB) ^, namely 

(AB)"^ = b"^ a"^. 

Proof. We have 



and 



(AB) (B ^A ^) =^ A(B B ^)A ^ = A A ^ = I 
(b'^a"^) (AB) = b'~^(a'~^A)B = B~^B = I, 
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so that B ^ is the inverse of AB by the definition of inverse. 

You will recall that, for 2x2 matrices, this same proof was used in 
establishing Theorem 2»8. 

Corollary 3^1—1 * If square matrices A,B,..»,K of order n have inverses 
A""^, b""^, . . . ,k"^, then the r;roduct AB'-'K has an inverse (AB*-*IC) , namely 

—1 —1 ^1 —1 
(AB***K) ^ = K - •B . 

The proof by mathematical induction ±3 l&fX as an exercise. 

Corollary 3-1-2 . If E^,E2,...,E^ sre eiementaxy matrices of order n, 
then the matrix 



has an inverse, namely 



This follows immediately from Coroli^i^F snd the fact that each 

elementary matrix has an inverse. 

The primary importance of an elemetrrary snatrix xests on the following 
property. If an m X n matrix A is raixltipiied on the left by an m X n 
elementary matrix E, then the product ±s itsie matrix obtained from A by 
the row operation by which the elementary inaitrri3i was formed from I. For 
example , 



10 0 
0 n 0 
0 0 1 

IOC 
Oil 
0 0 1 



a b c 
d e f 
g h ± 



m b :c 
u£. ne irf 
is: h i 



a 

c + e 

e 



b 

d + f 

f 



You should verify these and other similar ^iieft multiplications by elementary 
matrices to familiarize yumrself with the paititems . 
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Theorem 3—2 > Any elementary row operation can be performed on an m x n 
matrix A by multiplying A on the left by the corresponding elementary matrix 
of order m. 

The proof is left as an exercise. 

Multiplications by elementary matrices can be combined. For example, to 
add —1 times the first row to the second row, we would multiply A on the 
left by the product of elementary matrices of the type J ^KJ: 



-10 0 
0 10 
0 0 1 



10 0 
110 
0 0 1 



-10 0 
0 10 
0 0 1 



10 0 
-110 
0 0 1 



Note that J multiplies the first row by -1; it is necessary to multiply by 
j"*^ in order to change the first row back again to its original form after 
adding the first row to the second. Similarly, to add —2 times the first rov 
to the third, we would multiply A on the left by 



^00 
0 10 
0 0 1 



10 0 
0 10 
10 1 



-2 0 0 
0 10 
0 0 1 



10 0 
0 10 
-2 0 1 



To perform both of the above steps at the same time, we would multiply A on 
the left by 



10 0 
-110 
0 0 1 



10 0 
0 10 
-2 0 1 



10 0 
-1 1 0 

-:<: 0 \ 



Since is the product of two matrices that are themselves products of 

elementary matrices, M^^ is the product of elementary matrices. By Corollary 
3-1—1^ the inverse of is the product, in reverse order, of the corresj-onding 

inverses of the elementary matrices. 

Now our first step in the solution of the system of linear equations at 
the start of this chapter corresponds precisely to multiplying on the left by 
the aVove matrix Mj^. If we multiply both sides of System I on page 103 on the 
left by M^. 
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10 0 
-1 1 0 
-2 0 1 



1-11 

1 -2 -2 

2 13 



10 0 
-110 
-2 0 1 



~2 
~1 
1 



we obtain System II. For the second, third, and fourth steps, the corresponding 
matrix multipliers must be respectively 



Mo 



10 0 
0 --1 0 
0 3 1 



1 0 
0 1 
0 0 



110 
0 10 
0 0 1 



Thus multiplying on the left by has the effect of leaving the first row 

unaltered, multiplying the second row by -1, and adding 3 times the second 
row to the third. 

Let us now take advantage of the associative law for the multiplication of 
matrices to form the product 



110 
0 10 
0 0 1 



1 0 
0 1 
0 0 



10 0 
0-10 
0 3 1 



10 0 
-110 
-2 0 1 



A 


A 


A 


8 


8 


8 


7 


1 


3 


8 


8 


8 


5 


3 


1 


8 


^ 8 


8 



We recognize the inverse of the original coefficient matrix, as determined in 
Section 3--3, 

Theorem 3-3 > If 

BA « I, 

where B is a product of elementary matrices, then 

AB « I, 

so that B is the inverse of A. 



Proof. By Corollary 3-1—2, B has an inverse b"* . Now from 

138 

BA - I 

(aeo. >5] 



129 



we get 



whence 



A « b"^^, 



so that 



and 



AB = B ^B = I 



Exercises 3—5 



1. Find matrices A, B, and C such that 
(a) 



(b) 



(c) 



2 3 
0 2 
-1 -1 



3 4 
0 -2 
-I I 



2 3 
0 2 
0 -I 1 



10 0 
0 10 
0 0 1 



2. Express each of the following matrices as a product of elementary matrices; 



(a) 



0 


0 


-1 


0 


3 


1 


8 


8, 
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(b) 



10 0 
0-10 
0 0 1 



(c) 



1-11 

0 10 
0 0 1 



3. Using your answers to Exercise 2, form the inverse of each of the given 
matrices • 

4. Find three 4x4 matrices, one each of the types J, K, and L, that 
will accomplish elementary row transformations when applied as a left 
multiplier to a 4x2 matrix. 

5. Solve the following system of equations by means of elementary matrices: 

^: - y - 2z a 3, 
y + 3z « 5, 
2x + 2y - 3z « 15. 

6. (a) Find the inverse of the matrix 



1 0 1 

0-12 

2 0 1 



(b) Express the inverse a product of elementary matrices, ' 

(c) Do you think the answer to Exercise 6(b) is unique? Why or why not? 
Compare, in class, your answer with the answers found by other members 
of your class. 



7* Give a proof of Corollary 3-1—1 by mathematical induction. 
8. Perform each of the following multiplications: 



(a) 



(b) 



(c) 



e 
h 

b 
e 
h 

b 
e 
h 



0 
1 
0 

0 
1 
0 

0 
2 
1 
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State a general conjecture you can make on the basis of your solution of 
Exercise 8 . ' 



3--6 , Suinmary 

In this chapter, we have discussed the role of matrices In finding the 
solution set of a system of linear equations. We began by looking at the 
familiar algebraic methods of obtaining such solutions and then learned to 
duplicate this procedure using matrix notation and row operations. The Intimate 
connection between row operations and the fundamental operation of matrix 
multiplication was demonstrated; any elementary operation Is the result of left 
multiplication by an elementary matrix. We can obtain the solution set. If It 
exists, to a system of linear equations either by algebraic methods, or by row 
operations, or by multiplication by elementary matrices. Each of these three 
procedures Involves steps that are reversible, a condition that assures the 
equivalence of the systems. 

Our work with systems of linear equations led us tCL...a method for producing 
the Inverse of a matrix when it exists. The identical sequence of row operations 
that converts a matrix A into the identity matrix will convert the identity 
matrix into the inverse of A, namely A ^. The inverse is particularly helpful 
when we need to solve many systems of linear equations, each possessing the 
same coefficient matrix A but different rightHiand column matrices B. 

Since the matrix procedure ' diagonalized' the coefficient matrix, the 
method is often called the "diagonalization method." Although we have not 
previously mentioned it, there is an alternative method for solving a linear 
system that is often more useful when dealing with a single system. In this 
alternative procedure, we apply elementary matrices to reduce the system to the 
form 



1 


a 


b 




X 




^=1 


0 


1 


c 




y 






0 


0 


1 




z 




."3. 



as in System III in Section 3—1, from which the value for z can readily be 
obtained. This value of z can then be substituted in the next to last equation 
to determine the value of y, and so on. An examination of the coefficient 
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matrix displayed above shows clearly why this procedure is called the "».i*i— 
angularization method." 

On the other hand, the diagonalization method can be speeded up to bypass 
the triangularized matrix of coefficients in (1), or in System III of Section 
3-1, altogether. Thus, after pi voting on the coefficient of x in the System 

" r\ 

I, 



X- y + <5 = -2, 
I X - 2y - 2z = - 1, 

2x + y + 3z « 1, 

to obtain the system 





X - y 


+ z = - 2, 


II 


0 - y 


- 3z = 1, 




0 + 3y 


+ z = 5, 



we could next pivot completely on the coefficient of y to obtain the system 



x + 0+43 = -3, 
III' 0 + y + 32:=-l, 

0 + 0 - &z = 8, 



and then on the coefficient of z to get the system 



X + 0 + 0 = 1, 
IV' 0 + y + 0 = 2, 

0 + 0 + z=-l, 



which you will recognize as the system V of Section 3—1. 

It should now be plain that the routine diagonalization and triangulariza— 
tion methods can be applied to systems of any number of equations in any number 
of variables and that the methods can be adapted to machine computations. 
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Chapter 4 

REPRESENTATION OF COLUMN MATRICES AS GEOMETRIC VECTORS 



4-1 > The Algebra of Vectors 

In the present chapter, we shall develop a simple geometric representation 
for a special class of matrices — namely, the set of column matrices 
with two entries each. The familiar algebraic operations on this set of matrices 
will be reviewed and also given geometric interpretation, which will lead to a 
deeper understanding of the meaning and implications of the algebraic concepts. 

By definition, a column vector of order 2 is a 2x1 matrix. Consequent- 
ly, using the rules of Chapter 1, we can add two such vectors or multiply any one 
of them by a number. The set of column vectors of order 2 has, in fact, an 
algebraic structure with properties that were largeily explored in our study of 
the rules of operation with matrices. 

In the following pair of tisaorems , we summarize wharr we already know con- 
cerning the algebra of these vecr;ors, and in the next section wd' shall begin the 
interpretation of that algebra ^jbi geometric terms . 

Theorem 4-1 . Let V and W be column vectors of order 2, let r be a 
number, and let A be a square matrix of order 2. Then 



V + W, rV, and AV 



are each column vectors of order 2, 
For example, if 



V 



r " 4, and A 



0 1 
3 -2 



then 



V + W 



, rV » 4 



'1 ■ • 



and 



AV 



' 0 






l" 




CM 


3 


-2 




2^ 




-1 
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Theorem 4-2 , Let V, W, and U be column vectors of order 2, let r 
and s be numbers, and let A and B be square matrices of order 2. Then 
all the following laws are valid. 

I. Laws for the addition of veptors: 

(a) V + W « W + V, 

(b) (V + W) + U - V + (W + U), 

(c) V + 0 « V, 

(d) y + (-V) 0 . • 

II. Laws for the numerical multiplication of vectors: 

;a) r(V + W) « rV -r rU, 

(b) r(sV) - (rs)V, 

(c) (r + a)V « rV + s^T, 

(d) OV - 0, 

(e) IV « V, 

(f) rO - 0. 

III. Laws for the multiplication of vectors by matrices: 

(a) A(V + W) » AV + AW, 

(b) (A -I B)V « AV + BV, 

(c) A(BV) - (AB)V, 

(d) O^V « 0, 

(e) IV « V, 

(f) A(rV) « (rA)V = r(AV) . 

In Theorem 4—2, 0 denotes the column vector of order 2, and 0^ the ' 
square matrix of order 2% all of whose entries are 0. 

Both of the preceding theorems have already been proved for matrices. Since 
column vectors are merely special types of matrices, the theorems as stated must 
likewise be true. They would also be true, of course, if 2 were replaced by 
3 or by a general n, throughout, with the understanding that a column vector 
of order n is a matrix of order n 1. . . 
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Let 



V7 « 



-2 
1 



and U 



2 



let 



r » 2 and s = - 1; 



and let 



3 0 
2 -1 



and B 



1 1 
-2 1 



Verify each of the laws stated in Theorem A^2 for this choice of values 
for the variables . 

Determine the vector V such that AV - AW » AW + BW, where 



4]- [,]■ "i-il 

Determine the vector V such that 2V + 2W » AV + BV, if 

i\. .»a B. [-; -]. 



Find V, if 



Let 



Evaluate . 
(a) A 



[J] 



and 



(b) A . 
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(c) Using your answers to parts (a) and (b), determine tha entries of 
A if, for every vector V of order 2, 

AV = 0 . 

(d) State your result as a theorem. 

6. Kestate the theorem obtained in Exercise 5 if A is a square matrix of 
order n and V stands for any column vector of order n. Prove the new 
theorem for n = 3. Try to prove the theorem for all n. 

7. Using your answers to parts (a) and (b) of Exercise 5, determine the entries 
of A if, for every vector V of order 2, 

AV « V. 

State your result as a theorem. 

8. Restate the theorem obtained in Exercise 7 if A is a square matrix of order 
n and V stands for any column vector of order n. Prove this theorem for 
n B 3. Try to prove the theorem for all n. 

9. Theorems 4—1 and 4—2 summarize the properties of the algebra of column 
vectors with 2 entries. State two analogous theorems summarizing: the 
properties of the algebra of row vectors with 2 eirtries . Show that the 
two algebraic structures are isomorphic. 



4—2. Vectors and Their Geometric Representation 

The notion of a vector occurs frequently in the physical sciences, where at 
vector is often referred to as a quantity having both length and direction and 
accordingly is represented by an arrow. Thus force, velocity, and even dis- 
placement are vector quantities. 

Confining our attention to the coordinate plane, let us investigate this 
intuitive notion of vector and see how these physical or geometric vectors are 
related to the algebraic column and row vectors of Section 4-1. 

An arrow in the plane is determined when the coordinates of its tail and 
the coordinates of its head are given. Thus the arrow Aj^ from (1,2) to (5,4) 
is shown in Figure 4-1. Such an arrow, in a given position, is called a located 
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if 




(2,5) 



(-2,3) 



2-(-2)=4 



(1,2) 




i 4~-2=2 



(5,4) 



Figure 4-1. i^^rrows in the plane. 



vector ; its tall is called the initial point , and its head the terminal point 
(or end point ), of the located vector. A second located vector with 
initial point (—2,3) and terminal point (2,5), is also shovn in Figure 4-1. 

A located vector A may be described briefly by giving first the co- 
ordinates of its initial point and then the coordinates of its terminal point. 
Thus the located vectors in Figure 4-1 are 



A^: (1,2)(5,4) 



and 



A^: (-2,3)(2,5). 



The horizontal run, or x component , of A, is 



5 1 « 4, 



and its vertical rise, or y component, is 



4 2 « .2. 



Accordingly, by the Pythagorean theorem, its length is 
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-J (5-1)^ + (4-2)^ = ^ 1^ = 2/5 . 

Its direction is determined by the angle 0 that it makes xijith the positive 
X direction: 

4 2 y/j , „ 2 >/5 . 

cos 0 = = , sin 0 = = — ^ . 

275 5 2/5 ^ 

Since sin 0 is the cosine of the angle that makes with the y direction, 

cos 0 and sin 0 are called the direction cosines of Aj^ . • 
You might wonder why we did not write simp7y 

tan 0 = I = i 

instead of the equations for cos 0 and sin 9. The reason is that while the' 
value of tan 8 determines slope , it does not determine the direction of Aj^ . 
Thus if 0 is the angle for the located vector (5, 4) (1,2) opposite to Aj^, then 

—2 1 
tan 0 312^ = 2" « tan 0; 

but the angles 0 and 9 from the positive x direction cannot be equal since 
they terminate in directions differing by n. 

Now the X and y components of the second located vector A2: (—2,3) 
(2,5) in Figure 4—1 are, respectively, 

2 - (-2) a 4 and 5 - 3 « 2, 

so that Aj^ and A2 have equal x components and equal y components; con- 
sequently, thery have the same length and the same direction. They are not in 
the same position, so of course they are not the same located vectors; but since 
in dealing with vectors we are especially interested in length and direction we 
say that they are equivalent. 



Definition 4—1 , Two located vectors are said to be equivalent if and only 
if they have the same length and the same direction. 

For any prescribed point P in the plane, there is a located vector 
equivalent to (and to A2) ^'^^ having P as initial point. To determine 
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the coordinates of the terminal point, you have only to add the components of 

to the corresponding coordinates of P. Thus for the initial point 
P: (3,-7), the terminal point is 

(3 + 4, -7 + 2) = (7, -5), 
so that the located vector is 

A3: (3, -7)(7, ~5). 

You might plot the initial point and the terminal point of in order to 

check that it actually is equivalent to Aj^ and to A2. 

In general, we denote the located vector A with initial point (^L^y^) 
and terminal point (3^2*^2^ 

A: (x^,y^)(X2,y2). 
Its X and y components are, respectively, 

^2 ^1 ^2 ^1* 

Its length is 

r = ^/ (X2 - x^)^ + (y2 - y^)^ . 

If r 0, then its direction is determined by the angle that it makes with 
the X axis: 

^2 ^1 ^2 ^1 

cos 0 a , sin G = . 

r r 

If r « 0, we find it convenient to say that the vector is directed in 
any direction we please. As you will see, this makes it possible to state 
several theorems more simply, without having to mention special cases. For 
example, this null vector is both parallel and perpendicular to every direction 
in the plane I 

A second located vector 14:9 
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is equivalent to A if and only if it has the same length and the same direction 
as A, or, what amounts to the same thing, if and only if it has the same com- 
ponents as A: 

- = - x^, y4 ^ ^3 y2 ^1 • 

For any given point (x^jy^), the Located vectsor 

B: (xo,yo)(xQ 4- - x^,yQ 4- y^ y^) 

is equivalent to A and has (xQ,yQ) as its initial point. 

It "hus appears that the located vector A is determined except for its 
position by its components 

a « x^ - Xj^ and b = y^ - y^^. 

These can be considered as the entries of a column vector 




In this way, any located vector A determines a column vector V. Conversely, 
for any given point P, the entries of any column vector V can be considered 
as the components of a located vector A with P as initial point. The 
locater vector A is said to represent V. 

A column vector is a "free vector" in the sense that it determines the 
components (and therefore the magnitude and direction), but not the position, 
of any located vector that represents it. In particular, we shall assign to 
the column vector 




a s ta ndard rep res en ta tion 

OP : (0,0)(u,v) 
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as the located vectcr from the origin to the point 



P : (u,v), 



as illustrated in Figure 4—2; this is the representation to which we shall 
ordinarily refer unless otherwise stated* 



Similarly, of course, the components of the located vector A can be 
considered as the entries of a row vector. For the present, however, we shall 
consider only column vectors and the corresponding geometric vectors; in this 
chapter, the term "vector" will ordinarily be used to mean "column vector," 
not "row vector" or "located vector." 

The length of the located vector OP, to which we have previously referred 
is called the leng th or norm of the column vector 



P:(-3,4) 




Figure 4—2. Representations of the column vector 
located vectors OP and QR. 



-3 
4 



as 



V = 



u 



V 



Using the symbol I I VI I to stand for the norm of V, we have 
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Thusy if u and v are not both zero, the direction cosines of OP are 



u V 
IIVIl ^"""^ IIVII ' 

respectively; these are also called the direction cosines of the column vector 
V. 

The association between column or row vectors and directed line segments, 
introduced in this section, is as applicable to 3— dimensional space as it is to 
the 2— dimensional plane. The only difference is that the components of a 
located vector in 3-dimensional space will be the entries of a column or row 
vector of order 3, not a column or row vector of order 2. 

In the rest of this chapter and in Chapter 5, you will see how Theorems 
4—1 and 4—2 can be interpreter^ through geometric operations on located vectors 
and how the algebra of matrices leads to beautiful geometric results. 



Exercises 4—2 



1. Of the following pairs of vectors, 

(a) J , 3 ; (f) 

(b) J , ; (g) 

/V ii r 0 1 

(c) , - /2 5 

(d) ll] , (i) 

(e) hi. K^l; (j) 





- 5 






-12 


9 


r ^ 


1 




•3v/2 1' 




• 8 




15. 


> 




1 






5 


> . 




'3' 






4 


> 



3]^ 



16 
30 

-4 



3t 
4t 



which have t;b>s same length? Which have the sair.e direction? 



2. Let V « f 



Draw arrows from the origin representing V for 



t « 1, t « 2, t « 3, t - 1, t « - 2, and t = - 3. 
In each case, compute the length and direction cosines of V. 
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In a rectangular coordinate plane, draw the standard representation for 
<%ach of the following sets of vectors. Use a different coordinate plane 
for each set of vectors. Find the length and direction cosines of each 
vector: 



(a) 
(b) 
(c) 
(d) 
(e) 



1 

0 

1 

0 

i' 

0 

2 
3 

5 



and 



and 



and 



and 



and 



1 

0 

1 

0 

1 

0 

2 
3 

5 
-4 



Let V 





X 




' i' 




0 






- t 












m 







Draw the line segments OP representing V if t = 0, + 1, + 2, and ^ 

(a) m = 1, b = 0; 

(b) m « 2, b = 1; 

(c) m = - 1/2, b = 3. 

In each case, verify that the corresponding set of five points (x,y) lies 
on a line. 

Two column vectors are called parallel provided their standard geometric 
representations lie on the same line through the origin. If A and B 
are nonzero parallel column vectors, determine the two possible relation- 
ships between the direction cosines of A and the direction cosines of 
B. 



Determine all the vectors of the form 

(d) 



(a) 
(b) 
(c) 



that are parallel to 



-1 

3 



(e) 



9 
-5 

0 
3 







1 






10^ 
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4-3. Geometric Interpretation of the Multiplication of a Vector by a Number 

The geometrical significance of the multiplication of a vector by a number 
is readily guessed on comparing the geometrical representations of the vectors 
V, 2V, and -2V for 

-hi- 

By definition, 

- = [1] • 

while 




Thus, as you can see in Figures 4-3 and 4-4, the standard representations of 
V and 2V have the same direction, while -2V is represented by an arrow 
pointing in the opposite direction. The length of the arrow associated with V 




Figure 4-3. The product of Figure 4-4 • The product of 

a vector and a positive number. a vector and a negative number. 

is 5, while those for 2V and -2V each have length 10. Thus, multiplying 
V by 2 produced a stretching of the associated geometric vector to twice its 
original length while leaving its direction unchanged. Multiplication by -2 
not only doubled the length of the arrow but also reversed its direction. 

[see. 4-3] 



154 



These observations lead us to formulate the following theorem. 



1*5 



Theorem 4-3 . Let the directed line segment PQ represent the vector V 
and let r be a number. Then the vector rV is represented by a directed line 
having length Irl times the length of PQ. If r > 0, the repre- 
sentation of rV has the same direction as PQ; if r< 0, the direction of 
the representation of rV is opposite to that of PQ, 



Proof. Let V be the vector 



. Then 



Now, 



/ 2 ^ 2 
V u + V 



rV = 



ru 

rv 



hence, 



IrVM = -J Cru)^ + (rv)' 

= V r (u 4- V ) 

2 

. « I r I V ^ + V 



This proves the first part of the theorem. 
If 



r « 0 or V 



the second part of the theorem is certainly true. 
If 



r 0 and V 9^ 



the direction cosines of PQ are 
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Ik6 



ivi 



and 



IVI I 



v^hlle those of rV are 



Irl IIVI! 



and 



rv 



I IVI 



If r > 0, we have |r| = r, whence it follows that the arrows associated with 
V and rV have the same direction cosines and, therefore, the same direction. 
If r < 0, we have Irl = — r, and the direction cosines of the arrow associ- 
ated with rV are the negatives of those of PQ. Thus, the direction of the 
representation of rV is opposite to that of PQ. This completes the proof of 
- the theorem. 

One way of stating part of the theorem just proved is to say that if r is 
a number and V is a vector, then V and rV are parallel vectors (see 
Exercise 4—2-6); thus they can be represented by arrows lying on the same line 
through the origin. On the other hand, if the arrows representing two vectors 
are parallel, it is easy to show that you can always express one of the vectors 
as the product of the other vector by a suitably chosen number. Thus, by check- 
ing direction cosines, it is easy to verify that 



5 




" 50" 


, and 


-10 


-2 




-20 




4^ 



are parallel vectors, and that 



50 
-20 



= 10 



5 
-2 



while 



"-lo' 


= - 2 


5' 


4 




-2 



In the exercises that follow, you will be asked to show why the general result 
illustrated by this example holds true. 



Exercises 4—3 



1. Let L be the set of all vectors parallel to the vector 



Fill in 



the following blanks so as to produce in each case a true statement: 
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(a) 
(b) 
(c) 



(d) 



[-1/3] c . 



6 



(e) for every real number t. 



(g) for every real nunber h ^ 0, 



(f) for every real number t 
h 



€ L 



i L. 



2. Verify gi.a): hi caXly and prove algebraically that the vectors in each of the 
follou'i itr; pairj? sci: parallel. In each case, express the first vector as 
the ;xoJ»;'.t die second vector by a number: 



(b) 
(c) 



I 3 



1]^ 

M ■ 



4 
-5 



(d) 
(e) 
(f) 



[-32] ■ 

[4]. 
[0]. 



1 

-16 

4 

2 
9 



3. Let V bfc a vector and W a nonzero vector such that V a;Ad W are 
parallel. Prove that there exists a real number r such that 



V = rW. 



4. Prove: 



(a) If rV « 

(b) If rV = 



and r 9* 0, then V = 



and V 



, then r « 0 • 



5. Show that the vector V + rV has the same direction as V if r > — 1, 
and the opposite direction to V if r < — 1. Show also that 



1 IV + rVI I =« I IVI I II + rl . 



4^. Geometrical Interpretation of the Addition of Two Vectors 



If we have two vectors 



V and W, V = 
[sec. 



and W 



their sum 
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is, by definition, 



V + w = 



[•] 



c 




a + c 


d 


S3 


b + d 



To interpret the addition geotnetricalLy, Let us return momentarily to the con- 
cept of a "free" vector. Previously we have associated a column vector 



with some located vector 



such that c « - and d « y^ - y^. In particular, we can associate W 
with the vector 



A: (a,b)(a + c, b + d) . 



We use A to represent W and use the standard representation for V and 
V + W in Figure 4-5 • 



(a+c, b-Kl) 




Figure 4-5. Vector addition. 
Also, we can represent V as the located vector 

B: (c,d)(a + c, b + d) 
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and obtain an alCemaClve representation* If the two possibilities are drawn 
on one set of coordinate axes, we have a parallelogram In which the diagonal 
represents the sum; see Figure 4—6. 

▲ 

' (a+c, b+d) 



— ^• 




Figure 4—6. Parallelogram rule for addition. 

The parallelogram rule is often used in physics to obtain the resultant 
when two forces are acting from a single point. 

Let us consider now the sum of three vectors 



" = [f] ' ^= [b]' [d] 



We choose the three located vectors 

OP : (0,0)(a,b) 

PQ : (a,b)(a + c, b + d) 

and 

QR : (a + c, b + d)(a + c+e, bi-d-f-f). 
to represent V, W, and u respectively; see Figure 4—7 . 
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0 



Figure 4—7. The sum V + W + U. 

Order of addition does not affect the sum although it does affect the 
geometric representation, as indicated in Figure 4-^. 




Figure 4-8. The sum V + U + W. 

If V and W are parallel, the construction of the proposed representative 
of V + W is made in the same manner. The details will be left to the student. 

Theorem 4^ . If the vectors V and W are represented by the directed 
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line segments OP and PQ, respectively, then V + W is represented by OQ. 

Since V - y « V H- the operation of subtracting one vector from 

another offers no essentially new geometric idea, once the construction of -W 
is understood. Figure 4-9 illustrates the construction of the geometric vector 
representing V — W, It is useful to note, however, that since 

y2 2 
(u - r) + (v - s) , 

the length of the vector V - W equals the distance between the points 
P: (u, v) and T: (r, s) . 



I IV - Wl I » 



u - r 

V - s 




Figure 4-9. The subtraction of vectors, V - W. 



Exercises 4-4 



1, Determine graphically the sum and difference of the following pairs of 
vectors. Does order matter in constructing the sum? the difference? 



(a) 
(b) 
(c) 



7 
1 

-5 

2 



(e) 

(f) 
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-1 
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-2 
-3 

1 

0 
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2. IHuscrate graphically the associative law: 



(V + W) + U « V + (W + U) . 

3. Compute each of the following graphically: 

« [i\ * [-^ Ml] > 

[4] * [i] ^ ■ 

- h] ^ [;] ^ [3]^ - [3] • 

4. State the geometric significance of the following equations: 
(a) V + 



(b) V + W + U = , 



[0] 



(c) V + W + U + T 
5. Complete the proof of both parts of Theorem 4-4. 



4-'5* The Inner Product of Two Vectors 

Thus far in our development, we have investigated a geometrical interpre- 
tation for the algebra of vectors. We have represented colunn vectors of order 
2 as arrows in the plane, and have' established a one— to-one correspondence be- 
tween this set of column vectors and the set of directed line segments from the 
origin of a coordinate plane. The algebraic operations of addition of two 
vectors and of multiplication of a vector by a number have acquired geometrical 
significance . 

But we can also reverse our point of view and see that the geometry of 
vectors can lead us to the consideration of additional algebraic structure. 

For instance, if you look at the pair of arrows drawn in Figure 4-10, you 
may comment that they appear to be mutually perpendicular. You have begun to 
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w 




> X 



Figure 4~L0 . Perpendicular vectors • 



talk about the angle between the pair of arrows. Since our vectors are located 
vectors, the following definition is needed. 

Definition 4^2 . The angle between two vectors is the angle between the 
standard geometric representations of tUe vectors. 

Let us suppose, in general, that the points P, with coordinates (a,b), 
and R, with coordinates (c,d), are the terminal points of two geometric 
vectors with initial points at the origin* Consider the angle POR, which we 
denote by the Greek letter 0, in the triangle POR of Figure 4-11. 

We can compute the cosine of 0 by applying the law of cosines to the 
triangle POR. If iOPi, lORi, and IPRI are the lengths of the sides of 
the triangle, then by the law of cosines we have 



2I0PI lOR! cos e lOPI^ + lORI^ - IPRI^. 



A ... 




P:(a, b) 



R:(c, d) 



Figure 4—11. The angle between two vectors. 
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But 



lORl 



IPRI « -J (a-c)^ + (b-d)^ . 



Thus, 



2( V^a^ + b^) ( V^c^ + d^) cos e = (a^ + b^) + (c^ + d^) - { (a-^)^ + (b-d)^} 

= 2(ac + bd). 



Hence % 



lOPl lOR! cos 0 = ac + bd, 



(1) 



The number on the rightHiand side of this equation, although clearly a function 
of the two vectors, has not heretofore appeared explicitly. Let us give it a 
name and introduce, thereby, a new binary operation for vectors. 



Definition 4-3. The inner product of the vectors 



and 



1 

J , written 



Y\ ' [a] . 



is the algebraic sum of the products of corresponding entries. Sjrmbolically , 



ac + bd, 



We can similarly define the inner product of two row vectors: [ a • [c d] 
ac + bd. 

Another name for the inner product of two vectors is the "dot product" of 
the vectors. You notice that the inner product of a pair of vectors is simply a 
number. In Chapter 1, you met the product of a row vector by a column vector. 



say 



times 



, and found that 



[ab][^] = [ac+bd] , 
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the product being a 1x1 matrix . As you can observe, these two kinds of 
products are closely related; for, if V and W are the respective vectors 



[b] ^"'^ [d] • 



we have V = 



and 



« l^ac + bdj « [v • w] 



Later we shall exploit this close connection between the two products in order 
to deduce the algebraic properties of the inner product from the known prop?-ties 
of the matrix product. 

Using the notion of the inner product and the formula (1) obtained above, 
we can state another theorem. We shall speak of the cosine of the angle included 
between two column (or row) vectors, although we realize that we are actually 
referring to an angle between associated directed line segments. 

Theorem 4-"5 . The inner product of two vectors equals the product of the 
lengths of the vectors by the cosine of their included angle. Symbolically, 

V «» W « I IVI I I IWI I cos 0, 

where 9 ij che between the vectors V and W, 

Theorem 4—5 has been proved in the case in which V and W are not 
parallel vectors. If we agree to take the measure of the angle between two 
parallel vectors to be 0° or 180° according as the vectors have the same or 
opposite directions, the result still holds. Indeed, as you may recall, the law 
of cosines on which the burden of the proof rests remains valid even when the 
three vertices of the "triangle" POR are collinear (Figures 4^12 and 4-13), 





Figure 4-12. Collinear vectors 
in the same direction* 



Figure 4^13, Collinear vectors 
in opposite directions. 
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Corollary 4-5-1* The relationship 



V • V « 1 IV! 1^ 



holds for every vector V. 

The corollary follows at once from Theorem 4-5 by taking V « W, in which ' 

case e « 0°. To be sure, the result also follows immediately from the facts 
that, for any vector V « , we have 



y • V « a^ + b^, while 1 IVl 1 « V + • 

Two column vectors V and W are said to be orthogonal if the arrows 
OP. and OR representing them are perpendicular to each other. In particular, 
the null vector j^^j is orthogonal to every vector. Since 

cos 90° « cos 270° « 0> 
we have the following result: 

Corollary 4-5-2 . The vectors V and W are orthogonal if and only if 

V • W " 0. 

You will note that the condition V • W » 0 is automatically satisfied 
if either V or W is the null vector. 

We have examined eome of the geometrical facets of the inner product of two 
vectors, but let us now look at some of its algebraic properties. Does it sat- 
isfy commutative, associative, or other algebraic laws we have met in studying 
number systems? 

We can show that the commutative law holds, that is, 

V • W « W • V. 

For if V and W are any pairs of 2x1 matrices, a computation shows that 

V^W » W^V, 
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But 




while W^V « 



[w • v] • 



Hence 



V • W « W • V, 



It is equally possible to show that the associative law cannot hold for inner 
products. Indeed, the products V • (W • U) and (V • W) • U are meaningless. 
To evaluate V • (W • U), for exiample, you are asked to find phe inner product 
of the vector V with the number W • U. But the inner product is defined for 
two row vectors or two column vectors and not for a vector and a number. Inci- 
dentally, the product V(W • U) should not be confused with the meaningless 

V • (W • U). The former product has meaning, for it is the product of the vector 

V by the number W • U. 

In the exercises that follow?, you will be asked to consider some of the 
other possible properties of the inner product. In particular, you will be asked 
to establish the following theorem, the first part of which was proved above. 

Theorem 4^ . If V, W, and U are column vectors of order 2, and r 
is a real number, then 



(a) V • W - W • V, 

(b) (rV) • W - r(V • W), 

(c) V • (W + U) « V • W + V • U, 

(d) V • V > 0; and 

(e) if V • V « 0, then V « 



1. Compute the cosine of the angle between the two vectors in each of the 
following pairs: 




Exercises 4^5 



(b) 



(a) 
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(e) 
(f) 



[1]. ['I' <»> [3]' 



In which cases^ if any, are the vectors orthogonal? In which cases, if any, 
are the vectors parallel? 



Let 



and " 



[?]• 



Show that, for every nonzero vector V, 



V a Ej 

I IVI I 



V • E, 



and 



are the direction cosines of V. 

(a) Prove that two vectors V and W are parallel if and only if 

V • W « + 1 IVI I 1 IWI I . 

Explain the significance of the sign of the right-hand side of this equation 

(b) Prove that 

(V • W)^ < IIVII^ IIWII^ 

and write this inequality in terms of the entries of V and W. 

(c) Show also that V • W < I IVI I I IWI 1 . 
Show that if V is the null vector then 

V • W - 0. 

Fill in the blanks in the following statements so as to make the resulting 
sentences true: 

(a) The vectors 
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-10 






are 









are parallel* 



(b) The vectors 

(c) The vectors 

(d) The vectors 



[3 

4 
-3 



and 
and 
and 



-] 
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are orthogonal. 



3 
12 



are 



are parallel, 



(e) For every positive real number t, the vectors 



and 



are orthogonal. 



(f) For every negative real number t, the vectors 

2 



3t 
2t 



and 



are orthogonal . 



6. Verify that parts (a) - (d) of Theorem 4-6 are true if 



" - [1] ' " " [-3] ' ^ - h] ' 



and r a 4. 



7 . Prove Theorem 4-6 

(a) by using the definition of the inner product of two vectors; 

(b) by using the fact that the matrix product satisfies the 



equation 



V^W » [ V • w] 



8. Prove that | |V + Wl 1^ » (V + W) • (V + W) » I IVI 1^ + 2 V • W + I IWI l"^ for 
every pair of vectors V and W. 

9. Show that, in each of the following sets of vectors, V and W are 
orthogonal, V and T arc parallel, and T and W are orthogonal: 

(.) V. T. [t], „. 

Cb) V. [^]. [^J]. „. [-3]. 

Do the same relationships hold for the set 

'-[o]. -[f]. "-[.1' 

[see; 4-5] 
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10. Let V be a nonzero vector. Suppose that W and V are orthogonal, while 
T and V are parallel. Show that V; and T are then orthogonal. 

11. Show that, for every set of real numbers r, s, and n, the vectors 

and ^ 1^ r j orthogonal. 

12. Let V » ["J * where V is not the zero vector. Show that if W and V 
are orthogonal, there exists a real number t such that 

13. Show that the vectors V and W are orthogonal if and only if 

1 IV + Wl 1^ - 1 IV - Wl 1^ « 0. 

14. Show that if A = |^^| and B = j^^j , then 

1 lAl 1^ I IBl 1^ - (A • B)^ = (ad - bc)^. 

15. Show that the vectors V and W are orthogonal if and only if 

(V + W) • (V + W) = V • V + w • w. 

16. Show that the equation 

(V + W) • (V - W) « V • V - W • W 

holds for all vectors V and W. 

17. Show that the inequality 

1 IV + Wl I < 1 IVI I + I IWI I 
holds for all vectors V and W. 
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4-6, Geometric Considerations 



In Section 4-4, we saw that two parallel vectors determine a parallelogram. 
That is, if 



and B 



„ are two nonparallel vectors with initial points at the origin, then the points 
PKa,b), 0:(0,0), R:(c,d) and S: (a + c, b + d) are the vertices of a 

parallelogram (Figure 4—14.) A reasonable question to ask is: "How can we 

determine the area of the parallelogram PORS?" 




Figure 4—14, A parallelogram determined by vectors. 



As you recall, the area of a parallelogram equals the product of the lengths 
of its base and its altitude. Thus, in Figure 15, the area of the parallelo- 
gram KLMN is bj^h, where is the length of side NM and h is the length 
of the altitude KD* 




Figure 4-15. Determination of the area of a parallelogram. 

[sec. 4-6] 

171 



162 

But if is the length of side NK, and 9 is the measure of either 

angle NKL or angle KNM, we have 

h = I sin 91 . 

Hence, the area of the parallelogram equals ^^^2 's^"^'' 

Returning to Figure 4-14 and letting 9 be the angle between the vectors 
A end B, we can now say that if G is the area of parallelogram PORS, then 

^ I lAI I IBI 1^ sin^ 9, 

Now 

2 2 
sin 9 =s 1 — cos 9. 

It follows f rom Theorem 4~"5 that 



cos 9 a 



(A o B) 



lAI 



IBI I 



therefore. 



Thus, we have 



, 2 ^ IIAII^ IIBII^ - (A • B)^ 

sin 9 = 2 2 

I lAI I I IBI I 



G^ I lAII^ 1 IBI 1^ - (A • B)^. 



It follows from the result of Exercise 14 of the preceding section that 

G^ » (ad - bc)^. 

Therefore, 

G » lad - be I • 
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is the value of the determinant &(D), where D is the 
For easy reference, let us write our result in the form of a 



Theoreiu 4—7 . The area of the parallelogram determined by the standard 

a^ 
b 



.-representation of the vectors 
a c 



and 



equals 16(D) I , where 



b d 



Corollary 4—7—1 , 
if 6(D) = 0. 



The vectors 



and 



are parallel if and only 



The argument proving the corollary is left as an exercise for the student. 
You notice that we have been led to the determinant of a 2X2 matrix in 

examining a geometrical interpretation of vectors. The role of matrices in this 

interpretation will be further investigated in Chapter 5. 

From geometric considerations, you know that the cosine of an angle cannot 

exceed 1 in absolute value. 



I cos 01 < 1, 



and that the length of any side of a triangle cannot exceed the sum of the 
lengths of the other two sides, 

OQ < OP + PQ, 

Accordingly, by the geometric inteirpretation of column vectors, for any 



we must have 



and 



and W 



V • W 

^"72 1/2 



< 1 



(1) 



I IV + Wl I < I IVII + I IWI I • 



(2) 
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But can these inequalities be established algebraically? Let us see. 
The inequality (1) is equivalent to 



that is, 



(V • W)^ < (V • V)(W • W), 



'(ac + bd)^ < (a^ + b^)(c^ + d^), 



or — as we see when we multiply out and simplify — to 



2 2 2 2 
2 abed < a d + b c . 



But this can be written as 



0 < (ad - bc)^, 



which certainly is valid since the square of any real number is nonnegative. 
Since ad — be = &(D), where 



a c 
b d 



]• 



you can see that the foregoing result is consistent with Corollary 4-7—1, above 
that is, the sign of equality holds in (1) if and only if the vectors V and 
W are parallel. 

As for the inequality (2), it can be written as 



V(a + c)^ + (b + d)^ < V^a^ + + v^c^ + d'^, 
which simplifies to 



/"? 2 2 
2 ac + 2 bd < 2 Va + b Vc + d , 



which again is valid since 

0 < (ad bc)'^. 
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This time, the sign of equality holds if and only if the vectors V and W 
are parallel and 



ac + bd > 0, 

^that is, if and only^if the vectors V and W are parallel and in the same 
direction. 

If you would like to look further into the study of inequalities and their 
applications, you might consult the SMSG >tonograph, "An Introduction to 
Ineqmlities," by E. F. Beckenbach and R. Bellman. 



Exercises 4—6 



1. Let OP represent the vector A, and OT the vector B. Determine the 
area of triangle TOP if 



B 



-1 

4 

1 

2 



B 



B 



-2 
-2 

3 
-2 



(a) A = 

(b) A = 

(c) A « 

2. Compute the area of the triangle with vertices: 



(a) 


(0,0), 


(1.3), 


and 


(-3.1); 


(b) 


(0,0), 


(5.2), 


and 


(-10,-^); 


(c) 


(1,0), 


(0,1), 


and 


(2.3); 


(d) 


(1.1). 


(2,2), 


and 


(0,5); 


(e) 


(1.2). 


(-1.3). 


and 


(1.0). 



3, Verify the inequalities 

(V • W)^ < (V • V)(W«W) 

I IV + Wl I < I IVI I + I IWI I 



and 



for the vectors 
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(a) V » (3,4) and W « (5,12), 

(b) V « (2,1) and W « (4,2), 

(c) V » (-2,-1) and W « (4;2). 



4—7* Vector Spaces and Subspaces 

Thus far our discussion of vectors has been concerned essentially with 
individual vectors and operations on them. In this section we shall take a 
broader point of view. 

It will be convenient to have a symbol for the set of 2x1 matrices. 
Thus we let 



where R is the set of real numbers. The set H together vith the operations 
of addition of vectors and of multiplication of a vector by a real number is an 



Definition 4-4 . A set of elements is a vector space over the set R of 
real numbers provided the following conditions are satisfied: 



(a) The sum of any two elements of the set is also an element of the 
set* 

(b) The product of any element of the set by a real number is also an 
element of the set. 

(c) The laws I and II of Theorem 4-2 hold. 



In applying laws I and II, 0 will denote the zero element of the vector 
space. Let us emphasize, however, that the elements of a vector space are not 
necessarily vectors in the sense thus far discussed in this chapter; for ex- 
ample, the set of 2x2 matrices, together with ordinary matrix addition and 
multiplication by a number, forms a vector space. 

Since a vector space consists of a set together with the operations of 
addition of elements and of multiplication of elements by real numbers, strictly 
speaking we should not use the same symbol for the set C elements and for the 
vector space. But the practice is not likely to cause confusion and will be 




u e R and v € R 



example of an algebraic system called a vector space . 
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followed. 

A completely trivial example of a vector space over R is the sen con- 



sisting of the zero vector 
set of vectors parallel to 



alone. Another vector space over R is the 
that is, the set 



[3] 



r G R 



It is evident that we are concerned with subsets of H in these two examples. 
Actually, these subsets are subspaces , in accordance with the following defi- 
nition, ^ 

Defi nition 4-5 . Any nonempty subset F of H is a subspace of H 
provided the following conditions are satisfied: 

The sum of any two elements of F is also an element of F. 

The product of any element of F by a real number is an element of F. 

By definition, a subspace must contain at least one element V and also 
must contain each of the products rV for real numbers r. Every subspace of 
H therefore has the zero vector as an element, since for r « 0 we have 



OV 



It is easy to see that the set consisting of the zero vector alone is a subspace. 
We can also verify that the set of all vectors parallel to. any given nonzero 
vector is a subspace. Other than H itself, subsets of these two types are 
the only subspaces of H. 

Theorem 4-8 . Every subspace of H consists of exactly one of the follow- 
ing: the zero vector; the set of vectors parallel to a nonzero vector? the 
space H itself. 

Proof , If F is a subf.pace containing only one vector, then 
since the zero vector belongs to every subspace. 
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If F contains a nonzero vector V, then F contains all vectors rV 
for real r. Accordingly, if all vectors of F are parallel to V, it follows 
that 



F « |rV: r € rJ 



If F also contains a vector W not parallel to V, then F is actually 
equal to H, as we shall now prove. 
Let 



and 



be nonparallel vectors in the aubapace F, and let 



be any other vector of H. We shall show that Z is a member of F. 

By the definition of subspace, this will be the case if there are numbers 
X and y such that 



xV + yW « Z, 



(1) 



that is 9 




These can be found if we can solve the system 

i 

ax + cy » r, 
bx + dy " s, 

for X and y in terms of the known numbers, a, b, c, d, r, and s. But 
since V and W are not parallel, it follows (see Corollary 4-7-1) that 
ad - be 9* 0 and therefore the equations have the solution 



dr — cs 
ad — be 



and y 



as — br 
ad be 
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Since F is a subspace that contains V and W, it contains xV, yW, and 
their sum Z. Thus every vector Z of H must belong to F; that is, H is 
^.i subset of F. But F is given to be a subset of H. Accordingly, F = H. 

Using the of Section 4-4, we can give a geometric interpretation to 

Equation (1). Let lLs nonparalLel vectors V and W in the subspace F have 
standard representa'uions OP and OR, respectively; see Figure 4-16. Let Z 
be represented by OT. Since OP and OR are not parallel, any line p:^rallel 
to one of them must intersect the line containing the other. Draw the lines 
through T parallel to OP and OR,, and let S and Q be the points in 
which these lines intersect the lines containing OR and OP respectively. 
Then 

or «= OQ + OS. 




Figure 4-16. Representation of an arbitrary 
vector Z as a linear combination of a given 
pair of nonparallel vectors V and W. 

But OQ is parallel, to OP and OS to OR. Therefore, there are real numbers 
X and y such that 

OQ = xOP and OS = yOR. 

Hence, 
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Z = xV + yW, 



(1) 



This eiius our discussion of Theorem 4-7 and introduces the important concept 
of a linear combination : 

Definition 4-6 > If a vector Z can be expressed in the form xV + yW, 
v/here x and y are real numbers and V and W are vectors, then Z is 
called a linear combiuatj on of V and W, 

Further, we have incidentally established the useful facts stated in the 
following theorems: 

Theorem 4-9 . A subspace F contains every linear combination of each 
pair of vectors in F, 

Theorem 4-10 . Each vector of H can be expressed as a linear combination 
of any given pair of nonparallel vectors in H, 
For example, to express 



Z = 



5 
10 



as a linear combination of 



V = 



and W = 



-3 
4 



we must determine r^l numbers x and y such that 



5' 




" 4' 




"-3' 


10 


= X . 


3 


+ y 


4 



4x - 3y 
3x + 4yJ * 



Thus, we must solve the set of equations 

5 = 4x - 3y, 
10 = 3x + 4y. 

We find the unique solution x - 2 and y = 1; that is, we have 
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If you observe that the given vectors V and W in the foregoing example 
are orthogonal, that is, V • W 0 (see Coroilary 4-5-2 on page 156) then a 
second method of solution may occur to you. For if 



Z = aV + bW, 



then for the products Z • V and Z o W you have 



But 



Hence, 



Z • V ^ aWVW^ and Z«W = b|lWII^, 



V = 50, Z • W = 25, I IVI 1^ = 25, and ! IWI 1^ = 25. 



50 = 25a and 25 = 25b , 

and accordingly 

a = 2 and b = 1 . 

It is worth noting that the representation of a vector Z as a linear 
combination of two given nonparallel vectors is unique; that is, if the vectors 
V and W are not parallel, then for each vector Z the coefficients x and 
y can be chosen in exactly one way (Exercise 4—7—11, below) so that 

Z = xV + yW. 

The pair of nonparallel vectors V and W is called a basis for H, while the 
real numbers x and y are called the coordinates of Z relative to that 
basis. In the example above, the vector | ,^1 has coordinates 2 and 1 

and [ 4J " 

In particular, the pair of vectors \q'\ [i] called the 

natural basis for H. This basis allows us to employ the coordinates of the 
point (u,v) associated with the vector V = as the coordinates relative 

to the basis; thus, 
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u 




* l' 




"o" 




ss u 


0 


+ V 




V 






1 

- 



Since every vector of H can be expressed as a linear combination of any 
pair of basic vectors, the basis vectors are said to span the vector space. 
The minimal number of vectors that span a vector space is called the dimension 
of the particular space. 

For example, the dimension of the vector space H is 2. In the same 
sense, the set F 





'2' 


; r e Rj 




3 




L 





" 1 




"0' 


0 


nor 


1 



is a subspace of dimension 1. Note that neither ^ nor , is a basis 
for this subspace. (What is?) 

In a 2— space, that is, a vector space of dimension 2, it is necessary that 

any set of basis vectors be linearly independent. 



Definition 4—7 . Two vectors V and W are linearly independent if and 
only if, for all real numbers x and y, the equation 

xV + yW = 0 



implies x = y = 0. Otherwise, V and W are said to be linearly dependent . 

_ 2 

and W = 



For example, let V = 



^/2 
-3 



3 



Since 







2 ' 
3 


-3 


+ 3 





/2 

V and W are linearly dependent. Note that 



= 0, 



which indicates that V and W are parallel. 
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Exercise 4—7 

Express each of the following vectors as linear combinations of 
, and illustrate your answer graphically: 



[3] 



and 



(a) 
(b) 
(c) 



1 

-2 

0 
-3 



'(d) 
(e) 
(f) 



(g) 
(h) 
(i) 



[-1 
[0]. 



~3 
4 



In parts (a) through (1) of Exercise 1, detemine the coordinates of each 
of the vectors relative to the basis 



2" 




"2' 




and 




-1 


4 



Prove that the following set is a subspace of H: 



S = 



: r e R 



Prove that, for any given vector W, the set {rW : r e R] is a subspace 
of H. 



For 



V ^ 



determine which of the followini;; subsets of H are subspaces: 

(a) all V \;ith u « 0, (d) all V with 2u - v = 0, 

(b) all V with v equal (e) all V with u + v « 2, 
to an integer, 

(c) all V with u rational, (f) all V with uv = 0 . 

Prove that F is a subspace of H if and only if F contains every 
linear combination of two vectors in F. 



Show that 
vectors 



cannot be expre£3sed as a linear combination of the 



hi 





6 


and 


-15 



Describe the set of all linear combinations of two given parallel vectors, 
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9. Let and be subspaces of H. Prove that the set F of all 

vectors belonging to both F^^ and F^ is also a subspace . 

10. In proving Theorem 4-10, we showed that if V and W are not parallel 
vectors, then each vector of H can be expressed as a linear combination 
of V and W. Prove the converse: If each vector of H has a rep re— . 
sentation as a linear combination of V and W, then V and W are not 
parallel • 

11. Prove that if V and W are not parallel, then the representation of any 
vector 2 in the form aV + bW is unique; that is, the coefficients 

a and b can be chosen in exactly one way. 

12. Show that any vector 

• \ 

u 

V 

w 

can be expressed uniquely as a linear combination of the basis vectors 



3 




2 




-2 


2 


> 


-1 


, and 




-1 




1 




5 



4~6 . Summary 

In this chapter, we have developed a geometrical representation - — namely, 
directed line segments — for 2x1 matrices, or column vectors. Guided by 
the definition of the algebraic operation of addition of vectors, we have found 
the "parallelogram law of addition" of directed line segments. The multiplica- 
tion of a vector by a number has been represented by the expansion or contraction 
of the corresponding directed line segment by a factor equal to the number* with 
the sign of the factor determining whether or not th.:: cLl ...ction of the line 
segment is reversed. Thus, from a set of algebraic elements we have produced a 
set of geometric elements. Geometrical observations in turn led us back to 
additional algebraic concepts. 

Also in this chapter, we have introduced the important concepts of a vector 
space and linear independence. 

Since the nature of the elements of a vector space is not limited except 
by the postulates, the vector space does not necessarily consist of a set whose 
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elements are 2x1 column vectors; thus its elements might be n x n matrices, 
real numbers, and so on. 

For example let us look at the set P of linear and constant polynomials 
with real coefficients, that is, the set 



under ordinary addition and m :iplication by a number- The sum of two such 
polynomials , 



is an element of the set since the sums a^^ + a^ and + b^ are real numbers. 
The product of any element of P by a real number. 



is also a member of P since the products ac and be are real numbers, We 
can similarly show that addition is commutative and associative, that there is 
an identity for addition, and that each elemer" has an additive inverse; thus, 
the laws I of Theorem 4.2 are valid. In like fashion, we can demonstrate that 
laws II are satisfied: both distributive laws nold; the multiplication of an 
element by two real numbers is associative; the product of any element p by 
the real number 1 is p itself; the product of 0 and any element p is the 
zero element; and the product of any real number and the zero element is the 
zero element . 

We have outlined the proof that the set P of linear and constant poly- 
nomials is a vector space. Thus the expression, "the vector, ax + b," is 
meaningful when we are speaking of the vector space P. 

The mathematics to which our algebra has led us forms the beginnings of a 
discipline called "vector analysis," which la an important tool in classical 
and modern physics, as well as in geometry. The "free" vectors that you meet 
in physics, namely, forces, velocities, etc., can be represented by our geometric 
vectors. The study in which we are engaged is consequently of vital importance 
for physicists, engineers, and other applied scientists, as well as for mathe- 
maticians • 




p: p(x) « ax + b, a,b € R 




(aj^x + h^) + (a^x «f b^) = (a^^ + a^)x + (b^^ + b^), 



c(ax + b) = acx + be. 
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Chapter 5 
TRANSFORMATIONS OF THE PLANE 

5-1. Functions and Geometric Transformations 

You have discovered that one of the most fundamental concepts in your study 
of mathematics is the notion of a function. In geometry the function concept 
appears in the idea of a transformation. It is the aim of this chapter to recall 
what we mean by a function, to define geometric transformation, and to explore 
the role of matrices in the study of a significant class of these transformations. 

You recall that a function from set A to set is a correspondence or 
mapping from the elements of the set A to those of the set B such that with each 
element of A there is associated exactly one element of B. The set A is 
the domain of the function and the subset of B onto which A is mapped is the 
range of the function. In your previous work, the functions you met generally 
had sets of real numbers both for domain and for range. Thus the function 
symbolized in the form 

2 

X — > X 

2 

is likely to be interpreted as associating the nonnegative real number x with 
the real number x. Here you have a simple example of a "real function" of a 
"real variable." 

In Chapter 4, however, you met a function V — > 1 I VI I having for its 
domain the vector space H, and for its range the set of nonnegative real 
numbers . 

In the present chapter, we shall consider functions that have their range 
as well as their domain in H. ,jecif ically , we want to find a geometric in- 
terpretation for these "vector functions" of a "vector variable"; this is a 
continuation of the discussion started in Chapter 3. All vectors will now be 
considered in their standard representations OP, so that they will be parallel 
if and only if represented by collinear geometric vectors. 

Such a vector function will associate, with the point P having coordinates 
(x,y), a point P' with coordinates (x*,y'). Or we may say that it maps the 
geometric vector OP onto the geometric vector OP* . The function can, there- 
fore, be viewed as a process that associates with each point P of the plane 
some point P' of this plane. We shall call this process a transformation of 
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the plane into itself or a geometric transformation . As a matter of fact, 
these transformations are often called "point transformations'* in contrast to 
more general mappings in which a point may be carried into a line, a circle, or 
some other geometric configuration. For us, a geometric transformation is a 
helpful means of visualizing a vector function of a vector variable. As a 
matter of ct- * mlent terminology, we shall call the vector that such a function 
associates with a given vector V the image of V; furthermore, we shall say 
that the function maps V onto its image. 
Let us look at the simple function 

V — ^ 2V, V € H. 

This function maps each vector V onto the vector that has the same direction 
as V, but that: is twice as long as V. Another way of asserting this is to 
say that the function associates with each point P of the plane a point P' 
such that P and P* lie on the same ray from the origin, but 

I lOP* 1 I = 21 lOPI 1 ; 

see Figure 5—1. You may therefore think of the function in this example as 
uniformly stretching the plane by a factor 2 in all directions from the origin. 
(Under this mapping, what is the point onto which the origin is mapped?) 
As a second example, consider* the function 

V — > - V, V e H. 

This time, each vector is mapped onto the vector having length equal and direction 
opposite to that of the given vector. Viewed as a point transformation, the 
function associates with any point P its "reflection'* in the origin; see 
Figure 5~2 . 

The function >. 

V — ^ - 2V 

combines both of the effects of the preceding functions, so that the vector 
associated with V is twice as long as V, but has the opposite direction to 
that of V. 
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Figure 5-1 . The trans- 
formation V — > 2V. 



Figure 5-2. The trans- 
formation V — i> —V, 



Now, let us look at the function 



V 



! IVI I V. 



As in our first example, each vector is mapped by the function onto a vectror 
having the same direction as the given vector. Indeed, every vector of length 
1 is its own image. But if IIVM > 1, then the image of V has a length 
greater than that of V, with the expansion factor increasing with the length 
of V itself. Thus, the vector • 



2 
0 



having length 2, is mapped onto 



4 
0 



which is twice as long. The vector 



5 
12 



whose length is 13, has the image 
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65 
156 



with length 169. On the other hand, for nonzero vectors of length less than 
1, we obtain image vectors cf shorter Iringth, the contraction factor decreasing 
with decreasing length of the original vector. Thus, 



is mapped onto 



the image being half as long as the given vector. Again, the vector 



is mapped onto 



20 
49 

49 



the length of the first vector being 5/7, while the length of its image is 
only (5/7)^, or 25/49. Although we may try to think of this mapping as a 
kind of stretching of the plane in all directions from the origin, so that any 
point and its image are collinear with the origin, this mental picture has also 
to take into account the fact that the amount of expansion varies with the 
distance of a given point from the origin, and that for points within the circle 
of radius 1 about the origin the so-called stretching is actually a compression. 

We have been considering transformations of individual vectors; let us look 
at certain transformations of the square ORST, determined by the basis vectors 
, As shown in Figure 5-3, the function 

V — > -V 



and 



maps 



'r 




' o' 


0 


and 


1 



respectively, onto 

and 
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Figure 5—3. Reflection in the origin. 

Another transformation that is readily visualized is the reflection in the 
X axis. For this mapping, the point (x,y) goes into the point (x,-y); 
see Figure 5—4. 



(x,y) 



• (x,-y) 



Figure 5—4. Reflection in the x axis. 
That is, the map is given by 



X 




X 


y 







Using a matrix, we may rewrite this result in the form 
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i 0 
0 -1 



as is easily verified. 

Now this transformation, applied to the square ORST, leaves thepoint 



(1,0) unchanged; thus the vector 



is mapped onto 



0 
-1 



is mapped onto itself. The vector 



Reflection in the y axis, or a rotation of 180° in space about the 
y axis, can expressed similarly: 



r 














X 






SI 






[ 










0 







as shown in Figure 5-5. 





Figure 5-5. Rotation of 180 about the axes. 

Casual observation (see Figures 5—5 and 5-6) may lead you to assume that 
a 180° rotation about the y axis and 90° rotation about the origin in 
the (x,y) plane are equivalent; they are not. The first transformation 
leaves the point (0,1) unchanged, whereas the second transformation maps 
(0,1) onto (—1,0). As a vector function, the 90° rotation with respect to 
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che origin i i exp^^^^^cd 





Fi^i>^^ 5^6' R^^^tion of 90^ about the origin. 

Th^ trans fot^^Ws °^ FiSU^es 5-2 through 5-6 have altered neither the 
size nof the shaj?^ t^^e sq^^re^ "stretching" function of Figure 5-1, 

does alt^^" size, yv ''^h^^^" th^t j^qvbs each point parallel to the' x axis 
through a distan(>^ tv?ic^ ^.j^^e ordinate of the point alters shape. Con- 

sider the trans f(^^^:toP 











' 1 2' 




2t 










0 1 




y 



which m^Ps the b^^l^ ^^^ctor^ c?nto 



and 



respectively , 



The resi^^t is a j^l^^^^lngi ^^^^ ^^^nsf^^^ms the square into a parallelogram; see 
Figure 5-7, What: the ^tt^t^\^^^s do to shape? The shear to size? 
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Figure 5-7 . Shearing . 



Another type of transformation involves a displacement or "translation" 
in the direction of a fixed vector. The mapping 



V + U, where U = 



can be written in the form 



X 




X 


+ 


3" 








+ 


2 



One way of visualizing this function is to regard it as translating the plane in 
the direction of the vector U through a distance equal to the length of U. 

Transformation of two different types can be combined in one function. For 
instance, the mapping 



•|(V + U), where 



involves a translation and then a compression. When the function is expressed 
in the form 







X + 


3 


X 




2 




y 




y + 

2 





we recognize more easily that every noint P is mapped onto the midpoint of 
the line segment joining P to the point (3,2); see Figute 5-8. 
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Figure 5-8. The transformation V — > i(V + U) . 

Under this mapping, the square ORST will likewise be translated 
toward the point (3,2) and then compressed by a factor ~ , as shown in 
Figure 5--9 . This figure enables us to see that the points 0, R, S, and T 
are mapped onto midpoints of lines connecting these points to (3,2). 







/ 

/ 

/ 




0 


\ 



Figure 5-9. Translation and compression. 



All the vector functions discussed above map distinct points of the plane 
onto distinct points . This is not always the case; we can certainly produce 
functions that do not have this property. Thus, the function 
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'o' 

maps every point of the plane onto the origin. On the other hand, the trans- 
formation 



X 




X 






0 



maps the point (x,y) onto the point of the x axis that has the same first 
component as V. For example, every point of th^ line x = 3 is mapped onto 
the point (3,0) . Since the image of each point P can be located by drawing 
a perpendicular line from P to the x axis, we may think of P as being 
carried or projected on the x axis by a line perpendicular to this axis. Con- 
sequently, this mapping may be described as a perpendicular or orthogon al 
projection of the plane on the x axis. You notice that these last two functions 
map H onto subspaces of H. 

Since we have met examples of transformations that map distinct points onto 
distinct points and have also seen transformations under which distinct points 
may have the same image, it is useful to define a new term to distinguish be- 
tween these two kinds of vector functions. 

Definition 5-1 . A transformation from the set H onto the set H is one^ 
to-one provided that the images of distinct vectors are also distinct vectors. 

Thus, if f is a function from H onto H and if we write f(V) for 
the image of V under the transformation f, then Definition 5-1 can be 
formulated symbolically as follows: The function f is a one-to-one transforma- 
tion of H provided that, for vectors V and U in H, 



implies 



V -1* U 
f(V) 7« f(U) 



Exercises 5—1 



1. Find the image of the vector V under the mapping 

195 

V — > 3V 
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for each of the following values, of V: 

si , V [o 

1 ' 0 



2. 



(a) I , (c) 

(b) _i . (d) 
Find f(V) under the mapping 



(e) 
(f) 



[-3]- 
[-'3] • 



f : V = 



for each of the following values of V: 



(a) 



(b) 



(c) 
(d) 



[?]• 



(e) 5 

(f) -2 



-1 
-3 

2 
0 



3. Describe the geometric effect of each of the following transformations of 

X 



H on the vector V = 

(a) V V, 

(b) V 

(c) V — > aV, a > 0, 
>• - aV, a > 0, 



(d) V 

(e) V 

(f) V 

(g) V 



0 

y 

y 
y 

— X 

y 



(h) V 

(i) V 
(j) V 
(k) V 
(1) V 
(m) V 

(n) V 



X 




-y 


» 










3x" 








X + y 



X 

2x + y 
X - 2y 

y 

y — 3x 



4, Determine which of the transformations in the preceding exercise are one- 
to-one • 

5, Find expressions of the type V — > V for the transformations of H that 
map each point P onto the point P' related to P in the ways described 
below: 
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(a) P' is one unit to the right of P and four units above P;, 

(b) P' is the perpendicular projection of V on the horizontal line 
through (3,2); 

(c) P* is the perpendicular projection of P on the vertical line 
through (-1,-2); 

(d) OP and 5p' are collinear but opposite in direction, and 
1 lOP' I 1 « i I lOPl I ; 

(e) P' is the intersection of the horizontal line through P with 
the line of slope -1 passing through the origin (horizontal 
prcjection on the line y = - x) ; 

(f) P* is the intersection of the vertical line through P with 
the line y = 2x (vertical projection on the line y « 2x) • 

Show that the mapping of H into itself that sends each point P into the 
point of intersection of the line y = x with the line through P having 
slope 2 is given by 



X 




2x — y 


y 




2x - y 



(a) Show that the mapping 





X 




X -f 2y 








4x -f 3y 



can be expressed in the form 



1 2 
4 3 



(b) Find the image under this transformation of 



(c) Find the image under this transformation of the subspace of 

'^l 



vectors collinear with 
8. Solve parts (b) and (c) of Exercise 7 when 

(a) \l\ . M 

m 

(b) I . 1 an W 



is replaced by 
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1 
-1 

1 

2 
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9. Under the transformation given in Exercise 7, find by two different methods 
the image of each of the following vectors: 



(a) 
(b) 
(c) 

10. Consider the mapping 



4" 




' 2" 


2 


« 2 


1 


'5' 




2 


4 


83 


1 


3 




2 


2 ^ 




1 



(d) 
(e) 
(f) 



1 

v/2 



1 1 
1 



(a) Find the images under this mapping of the pair of points (5,1) 
and (1,-2), and show that the distance between the given pair 
of points equals the distance between their images. 

(b) Solve part (a) if the given points are (-2,10) and (6,-5). 

(c) Solve part (a) if the given points are (a,b) and (c,d). 



5-2. Matrix Transformations 

As noted earlier, especially in Chapter 3, the pair of equations 



X + a,^ y t= b^, 



22 



can be written in the form 



where 



AV = B, 



^1 ^12 
^21 ^22 



V « 



, and B « 



Consequently, in solving the equations you actually determine all the vectors V 

that are mapped onto the particular vector B by the function 
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V — > AV. 



(1) 



The study of the solution of systems of linear equations thus leads to the 
consideration of the special class of transformations on H that are expressible 
in the form (1), where A is any 2 x 2 matrix with real entries. These matrix 
transformations constitute a very important class of mappings, having extensive 
applications in mathematics, statistics, physics, operations research, and 
engineering. 

An important property of matrix -transformations is that they are Unear 
mappings ; that is, they preserve vector sums and the products of vectors with 
real numbers • 

Let us formulate these ideas explicitly. 



Defini 



tion 5-2. A linear transformation on H is a function f from H 



into H such that 

' (a) for every pair of vectors V and U in H, we have 

f(V + U) = f(V) + f(U); 

(b) for every real number r and every vector V in H, we have 

f(rV) « rf(V). 

Theorem 5-1. Every matrix transformation is linear. 

P roof . Let f be the transformation 

f : V — > AV, 

where A is any real matrix of order 2. We must show that for any vectors V 
and U, we have 

A(V + U) AV + AU; 

further, we must show that for any vector V and any real number r we have 

A(rV) « r(AV). 
[sec. 5^2] 
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But tliese equalities hold in virtue of parts III (a) and m (f) of Theorem 4-2 
(see page 134) . 

The linearity property of matrix transformations can be used to derive the 
following result concerning transformations of the subspaces of H. 

Theorem 5—2 . A matrix A maps every subspace F of H onto a subspace 
F' of H. 

Proof, Let F' denote the set of vectors 



{AU: U e F). 

To prove that F' is a subspace of H, we must show that the following state- 
ments are true: 

(a) For any pair of vectors P' , Q' in F' , the sum P' + Q' is 
in F' . 

(b) For any vector P' in F' and any real number r, rp' is in 
F' . 

If P' and Q' are in F', then they must be the images of vectors P 
and Q in F; that is, 

P' « AP, 
Q' « AQ. 

It follows that 

P" + Q' = AP + AQ = A(P + Q), 

and P' + Q' is the image' of the vector P + Q in F. (Can you tell why 
P + Q is in F?) Hence, (P' + Q') € F'. Similarly, 

rP' = r(AP) « A(rP), 

"and hence rP' is the image of rP. But rP e F because F is a subspace. 
Thus, rP' is the image of a vector in F; therefore, rP' € F' . 



Corollary 5~-2-«l , Every matrix maps the plane H onto a subspace of H, 
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either the origin, or a straight line through the origin, or H itself. 
For example, to determine the subspaces onto which 



4 2 
2 1 



maps 

(a) F 

(b) H itself, 

we proceed as follows. 

For (a), the vectors of F are of the form 



F » ' 




X 


: y - 3x 






y 





u = 



X 




l" 


-3x 


» X 


-3 



, X 6 R. 



Hence , 



AU 



4 2 
2 1 



f [-3])" ^ 2 l\ [-3 



-2 
-"1 



— X 



Thus, F is mapped onto F', the set of vectors collinear with 



; that is, 



In other words, A maps the line passing through the origin with slope -3 onto 
the line through the origin with slope 1/2. 

As regards (b) , w note that for any vector 



V = 



6 H, 



we have 



AV 





"4 2" 




X 




4x + 2y 


s 


2 1 




.y_ 


St 


2x + y 



(2x + y) 



Since 2x + y assumes all real values as x and y run over the set of real 
numbers, it follows that H is also mapped onto F' ; that is, A maps the 
entire plane onto the line 201 
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Exercises 5-2 



1 2 
4 3 



L. Let A = 

(a) V = 

(b) V = 

(c) V = 
determine: 



1 

2 

-3 
-2 

0 
2 



For each of the following values of the vector V, 
(d) V = 



5 
-1 



(e) V = 

(f) V = 



3 
0 

-6 
4 



(i) the vector into which k maps V, 
(ii) the line onto which A maps the line containing V. 

2. A certain matrix maps 



into 



, and 



into 





3 




2 




1 




3 




1 




2 



Using this information, determine the vector into which the matrix maps 
each of the following: 



(a) 
(b) 

(c) 
(d) 



[51. 

HI' 



-2 
2 



(e) 
(f) 
(g) 



[3]. 

[3l- 



3. Consider the following subspaces of H: 



^1 = 





0' 




0 




X 




y. 



S3 « 




X 


: y = 2x 






, y , 





y = 



2x 



F, = H itself. 
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Determine the subspaces onto which Fj^, F^, and are mapped by each 

of the following matrices: 



(a) A 



-2 1 
-2 1 



(B) B 



0 1 
-1 3 



(c) AB, (d) BA. 



Let A = 



1 1 

C 1 



(a) Calculate AV for 



[-;] ■ [;l 



-1 
^1 



(b) Find the vector V for which 



AV 



-1 
1 



1 
-1 



Determine which of the following transformations of H are linear, and 
justify your answer: 



(a) V « 

(b) V- 

(c) V - 



X 




'x + 1 


y 




; y . 



2x 

5y 



X - y 
X + y 



(d) V 
(e) 

(f) V — > I IVI I V 



(V + U) , where U « 



Prove that the matrix A maps the plane onto the origin if and only if 



0 0 
0 0 



Prove that the matrix A maps every vector of the plane onto itself if and 
only if 



1 0 
0 1 



Prove that 



2 1 
0 1 
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maps the line y « 0 onto itself. Is any point of that line ma >p id onto 
itself by this matrix? 

(a) Show that each of the matrices 



1 0 

0 0 



and 



1 2 
0 0 



maps H onto the x axis* 

(b) Determine the set of all matrices that map H onto the x axis. 
(Hint: You must deteirmine all possible matrices A such that 
corresponding to each V € H there is a real number r for 
which 



AV = r 



(1) 



In particular, (1) must hold for suitable r when V is replaced 



by 



and by 



.) 



Determine the set of all matrices that map H onto the y axis, 
(a) Determine the matrix A such that 

AV = 2V 

for all V. 

(b) The mapping 



aV 



(a > 0) 



multiplies the lengths of all vectors without changing their directions. 
It amounts to a change of scale. The number a is Hccordingly called a 
scale factor or scalar. Find the matrix A that yi<slds only a change of 
scale: 

AV = aV. 



Prove that for every matrix A the set F of all vectors U for which 



AU = 

[sec. 5-2] 



204 



196 

la a subspace of H. This subspace is called the kernel of the mapping. 
13. Prove that a transformation f of H into itself is linear if and only if 

f(rV + sU) « r f(V) + s f(U) 

for every pair of vectors V and U of H and every pair of real numbers 
r and s • 



5—3. Linear Transformations 

In the preceding section, we proved that every matrix represents a linear 
transformation of H into H. We now prove the converse: Every linear trans- 
formation of H into H can be represented by a matrix. 

Theorem 5-3 * Lf;t f be a linear transformation of H into H. Then, 
relative to any given basis for H, there exists one and only one matrix A 
such that, for all V e H, 

AV « f (V) . 

Proof. We prove first that there cannot be more than one matrix represent- 
ing f . Suppose that there are two matrices A and B such that, for all 
V € H, 

AV « f (V) and BV « f (V) . 

Then 

AV - BV « f (V) f(V) 

for each V. Hence, 

(A - B)V « ^q] for all V € H. 

Thus, A - B maps e^^ery vector onto the origin. It follows (Exercise 5-2—6) 
that A - B is the zero matrix; therefore, 

A - B. 
[sec. 5-2] 
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Hence, there Is at most one matrix representation of £• 

Next, we show how to find the matrix representation for the linear transfot 
nation f* Let and be a pair of noncolllnear vectors of H« Let 



f(sp 



II 



and 



'12 
*22 



be the respective images of Sj^ and under the mapping f . If V is any 

vector of H, it follows from Theorem 4—10 that there exist real numbers 
and V2 such that V = ^i^i ^2^2* ^^"'^^ ^ is a linear transformation, we 
have 



f(V) = f(v^S^ + v^S^) = v^f(S^) + V2f(S2), 



Accordingly, 



f(V) = V. 



Thus, 



'11 
'21 



+ v„ 



f(V) 



12 
«22 



^1 ^2 
^21 ^22 



^1^ + ^2^2 
^2l\ + «22^2 



It follows that f Is represented by the matrix 



^I ^2 
^21 ^22 



when vectors are expressed In terms of their coordinates relative to the basis 

You notice that the matrix A is completely determined by the effect of f 
on the pair of noncolllnear vectors used as the basis for H, llius, once you 
know that a given transformation on H is linear, you have a matrix represent- 
ing the mapping when you have the Images of the natural basis vectors, 



I 




0' 




0^ 


and 


I 


• 



For example, it can be shown by a geometric argument that the counterclock- 
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wise rotation of the plane through an angle of 30 about the origin is a linear 
transformation. This function maps any point P onto the point P' , where the 
measure of the angle POP' is equal to 30° (Figure 5-10). It is easy to see 
(Figure 5-11) that 




Figure 5-10* A rotation through 
an angle of 30° about the origin. 



Figure 5-11. The images of the points 
(1,0) and (0,1) under a rotation of 30^ 
about the origin. 



is mapped onto 



cof 30 
sin 30^ 



and 



is mapped onto 



-sin 30 
cos 30^ 



Thus, the matrix representing this rotation is 



A " 



cos 30° -sin 30° 



sin 30° cos 30° 



V3 1 
2 2 

2 2 



Note that the first column of A is the vector onto which 

I 0 

the second column of A is the image of 



is mapped; 



The product or composition of two transformations is defined just as you 
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define the composition of two real functions of a real variable. 
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Definition 5—3. If f and g are transformations on H, then for each 
vector V in H the composition transformations fg and gf are the trans- 
formations such that 

fg(V) « f(g(V)) and gf(V) » g(f(V)). 

Thus, to find the image of V under the transformation fg, you first 
apply g, and then apply f. Consequently, if g maps V onto U, and if 
f maps U onto W, then fg maps V onto W. 

The following theorem is readily prov i —3—7). 

Theorem 5—4 * If f is a linear transformation represented by the matrix 

A, and g is a linear transformation represented by the matrix B, then fg 

and gf are both linear transformations; fg is represented by AB, while gf 
is represented by BA* 

For example, suppose that in the coordinate plane each position vector is 
first reflected in the y axis ^ and then the resulting vector is doubled 
in length. Let us find a matrix representation of the resulting linear trans- 
formation on H. If g is the mapping that transforms each vector into its 
reflection in the vertical axis, then we hsve 



X 








-1 


0 




X 


_y 




y 




0 


1 




,y _ 



If f maps each vector into twice the vej:xor, then we have 



X 




X 


X3 


" 2 


0 




X 


y ^ 




^y , 




0 


2 




.y. 



Accordingly, the matrix representing fg is 



2 0 
0 2 



-1 0 
0 1 



-1 0 
0 2 
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Exercises 5-3 

!• Show that each of the mappings in Exercise 5-1-3 is linear, by determining 
matrices representing the mappings, 

2* Consider the linear transformations > 

p: reflection in the horizontal axis, 

q: horizontal projection on the line y = - x (Exercise 5-l-5e) , 
r: rotation counterclockwise through 90°, 

s: shear moving each point vertically through a distance equal to 
the abscissa of the point, 

of H into H. In each of the following, determine the r.^atrix represent- 
ing the given transformation: 

(a) p, (f) qp, (k) s(rs), 

(b) q, (g) pr, (1) (sr)s, 

(c) r, (h) rp, (m) p(sq), 

(d) s, (i) qs, (n) :ps)q, 

(e) pq, (j) sq, (o) (sp)(rq) . 

3* Let f be the rotation of the plane ai4unt€!«5cV,;>ckwise through 45° about 
the origin, and let g be the rotatic33s, cliuc^i^ise through 30°. Determine 
a matrix representing the rotation coant^in:-iaikwise through 15° about the 
origin . 

4. (a) Prove that ^swery linear trans fomatzios tft*ps the origin onto itself, 

(b) Prove that ewery linear trans fon^<atiar nmass every subspace of H onto 
a subspace of H. 

5. For every two linear transformations L md g on H, define f + g to 
be the transformation such that, for ¥ e H, 

(f + g)(V) - ff 4 tf^). 

Without using matrices, prove that : + g tk, a linear transformation on 

6. For each linear transformation f oi H am each real number a, define 
af to be the transformation such thac 



af(V) « fr^aV) 
[aec^ 5-3] 

209 



201 

Without using matrices, prove that af is a linear transformation on H. 

7. Prove Theorem 5-4. 

8. Without using matrices, prove each of the following: 

(a) f(g + h) = fg + fh, 

(b) (f + g)h = fh + gh, 

(c) f(ag) = a(fg), 

where f, g, and h are any linear transformations on H and a is 
any real number. 



5-4. One-to-one Linear Transformations 

The reflection of the plane in the x axis clearly maps distinct points 
onto distinct points j thus, the reflection is a one— to-^ne linear transformation 
on H. Moreover, the reflection maps any pair of noncollinear vectors onto a 
pair of noncollinear vectors. It is easy to show that this property is common 
to all one-to-one linear transformations of H intn itself. 

Theorem 3—5. Every one— to-one linear transformation on H maps noncollinea 
vectors onto noncollinear vectors. 

Proof . Let and be a pair of noncollinear vectors and let 

f(Sj^) = Tj^ and f(S^) « T^ 

be their images under the one-to-one linear mapping f . Since f is one-to- 
one, we know that Tj^ and T^ are not both the zero vector. We may suppose, 
therefore, that is not the zero vector. To show that and T^ are 

not collinear, we shall demonstrate that the assumption that they are co.Minear 
leads to a contradiction. 

If Tj^ and T^ are collinear, then there exists a real number r such 
that T^ = r Tj^. Now, consider the image under f of the vector r S^^ . Since 
f is linear, we have 

f(r S^) ^ r f(Sj^) 
a r Tj^ 
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Thus, each of the vectors r S^^ and is mapped onto T^. Since f is one- » 

to-one, it follows that 

r = S^. 

and therefore that and S2 are collinear vectors. But thi ' contradicts 

the fact that and S^ are not collinear. Hence, the assumption that 

and T2 are collinear must be false. Consequently, f must map noncollinear T 
vectors onto noncolllnear vectors. 

Corollary 5—5—1 * The pjbspace onto which a one— to-one linear transformation 
maps H is H itself. 

Proof . Since the subspace contains a pair of noncollinear vectors, the 
corollary follows by use of Theorems 4—9 and 4—10. 

The link between one— to-x)ne transformations on H and second-order 
matrices having inverses is given in th<: next theorem. 

Theorem 5-6. Let f be a linear transformation represented by the matrix 
A. Then f is one-to-one i and only if A has an inverse. 

Proof . Suppose that A has an inverse. Let Sj^ and S^ be vectors in 
H having the same image under f. Now, 

f(Sj^) « ASj^ and fCS^) = AS^. 

Thus, 

ASj^ = AS.^. 

A^'^AS^) « a'^^CAS^)* 
(a""^A)Sj^ » {r^A)S^, 
ISj^ « IS^, 
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Thus, f must be a one— to-one transformation. 

On the other hand, suppose that f is one-to— one . From Theorem 5-5, it 
follows that every vector in H is the image of some vector in H. In particu- 
lar, there are vectors W and U such that 



f (W) = AW = 



and 



f (U) = AU = 



Accordingly, the matrix having for its first column the vector W, and for 
its second column the vector U, is the xnverse of A. , 

Coro 1 la ry 5—6-1 > A linear transformation represented by the matrix A is 
one— to-one if and only if 

5(A) 9« 0. 

The theory of systems of two linear equations in two -variables can now be 
studied geometrically. Writing the system 



^11 ^ + ^ " 
^21 ^ + ^22^'' 



(1) 



in the form 



AV = U, 



(2) 



where 





^11 ^12 




X 


, and U = 


u 






> V = 






> 




/21 ^22. 




y 




V 




V 


that are 


mapped by 


the matrix A onto 


the vector U. 
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If 6(A) 9* 0, we now know that A represents a one-to-one mapping of H 
onto H. Therefore, A maps exactly one vector V onto U, namely, V = A U. 
Thus, the system (1) — or, equivalently , (2) — has exactly one solution. 

If 6(A) = 0, then, in virtue of Corollary 4—7-1, the columns of A must 
be collinear -vectors . Hence, A must have one of the forms 

[o 0 

[O 0 

wot botti a and b are zero. If A has the first of these forms, their 
A maps H onto the origin. In the other two cases., A maps H onto the 
line of vectors collinear with the vector j^^j . (See Exercise 5-4-7, belowJ) 
With these results in mind, you may now complete the discuss icn of the solution 
of equation (2) . 

Exercises 5—4 

1. Using ThpnT^ 5—6 or its corollary, determine which of the transformations 
in Exercise 5—1—3 are one— to— one . 

2. Show that ^ linear transformation is one-to-one if and only if the kernel 
of the mapping consists only of the zero vector. (See Exercise 5—2—12.) 

3. (a) Show that if f is a one-to-one linear transformation on H, then 
there exists a linear transformation g such that, for all V e H, 

gf(V) = V 



0 a 
0 b 



or 



a ra 
b rb 



and 

fg(V) « V. 

The transformation g is called the inverse of f and is usually written 
.-1 

g a f . 

(b) Show that the trans foirmat ion g « f ^ in part (a) is a one— to-one 
transformation on K. 

4. Prove that if f and g are one-to-one linear transformations of H, then 
fg is also a one— to— one transformation of H. 
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5. Prove that the set of one— to-one linear transformations on H is a group 
relative to the operation of composition of transformations. 

6. Show that if f and g are linear trans .i H such that fg 
is a one— to— one transformation, then both i ? one— to— one trana 
formations. 

7. (a) Show that if 5(A) = 0, then the matrix A maps H onto a point 
(the origin) or onto a line. 

(b) Show that if A is the zero matrix and U is the zero vector, 
then every vector V of H is a solution of the equation AV » U. 

(c) Show that if 5(A) = 0, but A is not the zero matrix, then the 
solution set of the equation 

AV« 

is a set of collinear vectors. 

(d) Show that if 6(A) = 0, but A is not the zero matrix,and U is 
not the zero vector, then the solution set of the equation 

AV « U 

either is empty or consists of all vectors of the form 

[V^ + tV2 : t € R), 
where Vj^ and V2 are fixed vectors such that 
AVj^ " U and AV^ « 

8. Show that if the equation AV » U has more than one solution for any given 
U, then A does not have an inverse. 




5-5. Characteristic Values anC Characteristic Vectors 

If we think of a mapping as '^carrying" points of the plane onto other points 
of the plane, we might ask, through curiosity, if there are cases in which the- 
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image point under a mapping is the same as the point itself. Such 'fixed' 
points I or vectors, are of great importance in mathematical analysis. 

Let us look at an example. The reflection with respect to the x axis, 



V, 



that is, 



[^1 



1 0 
0 -1 



X 



has the property of mapping each vector on the x axis onto itself; thus, 
each of these vectors is fixed under the transformation. 

Definition 5^ . If a transformation of H into itself maps a given vector 
onto itself, then t'.hat vector is a fixed vector for the transformation. 

More generally, we are interested in any vector that is mapped into a 
multiple of itself; that is, we seek a vector V e H and a number c e R such 
that 

AV = cV. 

Since the equation is automatically satisfied by the zero vector regardless of 
the value c, this vector is ruled out. 

The number c is called a characteristic value (or eigenvalue) of A, and 
the vector V a characteristic vector of A. These notions are fundamental in 
atomic physics since the energy levels of atoms and molecules turn out to be 
given by the eigenvalues of certain matrices. Also the analysis of flutter 
and vibration phenomena, the stability analysis of an airplane, and many other 
physical problems require finding the characteristic values and vectors of 
matrices . 

In Section 5-1, we saw that the mapping 



A = 



A 2 
2 1 



carried the plane H onto the line y = x/2. If we consider the set F 



F = 



: y = 2 ^ 
[sec* 5-51 
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under this same mapping, we see that F is mapped onto F' , the set of vectors 
collinear with F, Note that 



2 






*10t' 


1 






5t 



= 5 



and hence that 3 is a characteristic value associated with A> and 
a characteristic vector for any t € R, t ^ 0. 



is 



Definition 5-5 , Each nonzero vector satisfying the equation 

AV == cV 



is called a characteristic vector , corresponding to the characteristic value 
(or characteristic root ) c of A. 



of the equation 



Note that, as remarked above, the trivial solution, 
is not considered a characteristic vector. 

Because of the importance of characteristic values in' pure and applied 
mathematics, we need a method for finding them. We seek nonzero vectors V 
and real numbers c such thau 



AV = cV. (1) 
If I is the identity matrix of order 2, then (1) can be written as 

AV w (cI)V, 



or 



If we let 



(A - cI)V = 0 . 



^1 ^12 
^21 ^22 



equation (2) becomes 



and V = 



(2) 



EKLC 
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'11 
hi 



c a 



32 



(3) 



We 



know that there is a nonzero vector V satisfying an equation of the form 



BV = 0 



if and only if 



5(B) = 0. 



Hence equation (2) has a solution other than the zero vector if and only if c 
is chosen in such a way as to satisfy the equation 



(^U - "^^^22 - - V2I = °- 



Rearranged y this equation becomes 



c - ia^^ + a^^)c + &(A) = 0, 



(4) 



which is called the characteristic equation of the matrix A. Once this 
quadratic equation is solved for c, the corresponding vectors V satisfying 
equation (1) can readily be found, as illustrated in the following example. 

Example . Determine the fixed lines under the mapping 



2 3 
0 1 



We must solve the matrix equation 



2 - c 3 
0 1 - c 



Since 



5(A - cl)' = (2 - c)(l - c), 
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the characteristic equation is 

(2 c)(l - c) = 0, 

or 

c^ - 3c + 2 = 0, 

the roots of which are 1 and 2. For c = 1, equation (3) becomes 

[l 3 
0 0 

which is equivalent to the system 

X + 3y = 0, 
I Ox + Oy « 0 . 

Thus, A maps the line x + 3y = 0 onto itself; that is, the set F 















-3 


: r e R 




r 








1, 



is mapped onto itself. Actually, since c = 1, each vector of this subspi 
is invariant: if V is a characteristic vector, f(V) its image, and c 
then 

f(V) = V. 

For c = 2, equation (3) becomes 

'o 3 
0 -1 

or 

3y » 0, 
-ly « 0 . 





Hence, 
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A » 



2 3 
0 1 



maps the line y « 0 onto itself; the set of v^^^tora F 



r e R 



is closed under the transformation* 

The characteristic equation associated with the matrix 



A « 



2 3 
0 1 



is 



c - 3c + 2 « 0 . 



This equation expresses a real-^umber function. For a matrix function, the 
corresponding equation is 

- 3C + 21 « 0, 

where I is the identity matrix of order 2 and 0 is the zero matrix of 
order 2. If we substitute A in this matrix equation, 



2 3 
0 1 



- 3 



2 3 
0 1 



+ 2 



1 0 
0 1 



0. 



we find that A is a root of its characteristic equation. This is truic; for 
any 2x2 matrix. 

Theorem W. The matrix A 



A « 



^1 hi 

^21 ^22 



is a solution of its characteristic equation 



A - {SL^^ + a22)A + 5(A)I « 0. 
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The proof is left an an exercise. 

Theorem 5—7 is the case n =« 2 of a famous theorem called the Cayley- 
Hamilton Theorem , which states that an analogous result holds for matrices of 
any order n. 



Exercises 5—5 

1. Determine the characteristic roots and vectors of each of the following 
matrices: 



(a) 
(b) 



2 5 
0 3 



-3 4 



(c) 



2 1 

-1 0 

0 2 

0 1 



2. Prove that zero is a characteristic root of a matrix A if and only if 
6(A) - 0. 

3. Show that a linear transformation f is one-to-one if and only if zero Is 
not a characteristic root of the matrix representing f. 

4. Determine the invariant subspaces (fixed lines) of the mapping given by 



6 2 
2 3 



Show that these lines are mutually perpendicular 



5. Show that the matrix A = 
(matrix) equation 



^11 ^12 
^21 ^22 



is a solution of its characteristic 



6. Show that 



A ia^^ + a22)A + 6(A)I = 0. 



is an invariant vector of the transformation 



but that 2 



V — ^ I IVI I V, 
is not invariant under this mapping. 
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7. 



Show that A maps every line through the origin onto itself if and only if 



r 0 
0 r 



for r ^ 0, 



8, Let d = (aj^j^ - a^^l^ + 4 ^i2^2V ^H' ^12' ^21' ^22 

real numbers. Show that the number of distinct real characteristic roots 
of the matrix 



^11 ^12 
^21 ^22 



is 

0 if d < 0, 

1 if d = 0, 

2 if d > 0. 



9. Find a nonzero matrix that leaves no line through the origin fixed. 

10. Determine a one— to-one linear transformation that maps exactly one line 
through the origin onto itself. 



r s 
s t 



has two distinct characteristic 



11. J Show that every matrix of the form 

roots if s 0 . 

12. Show that tiie matrix A and its transpose A^ have the same characteristic 
roots • 



5—6. Rotations and Reflections 

Since length is an important property in Euclidean geometry, we shall look 

for the linear transformations of the plane that leave unchanged the length 

IIVII of every vector V. Examples of such transformations are the following: 

(a) the reflectiur of the plane in the x axis, 

(b) a rotation of the plane through any given angle about thc^ origin, 

(c) a reflection In the x axis followed by a rotation about the origii 
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Actually^ ve can show that any linear traTT^fcannation that preserves the lengths 
of alii vectors is equivalent to one of tb=se^ree. The following theorem will 
be very us; Snl in proving that result. 

Tcr^^^n 5-6 . A linear trans forma ticn of H that leaves unchangnL the 
length -^v^iry vector nl^o leaves unchfficsd the inner p. ^^ii>-.r: i>- s^ery pair 

of rect^^fe"^ ^"^A (b) thrr magnitude of the ^rnii^i i^: ' *%^een every paiiT or :^*"Ctors . 

^ .. U>t V anc U be a pair c" s in H and let V and U* 

be chii^c: r*: .^ctive images under the cranf ' :ion • In virtue of Hxercise 

I IV + Ul 1^ = I IV! r .V • U + IIUII^ (1) 

and 

I IV' + U' 11^ « IIVVM + 2V' © U' + MUN r . (2) 
Since the transformation is linear, for the image of V + U we have 

(V + U)' « V + u* . 
Consequently, (2) can be written as 

I l(V + U)' 11^ = IIV* 11^ + 2V' • U' + I IV' I 1^. (3) 
But the trans fonn8.tion preserves the length of each vector; thus, we obtain 

IIV'II = IIVII, llU'll = MUM, and ll(V+U)Ml = llV+Uil. 
Making these substitutions in equation (3), we get 

IIV + Ul 1^ « I IVI 1^ + 2V' • U' + IIUI 1^. (4) 
Comparing equations (1) and (4), you see that we must have 

V • U = V • U* ; 

that is, the transformation preserves the value of the inner product. 
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Since the magnitude of the angle v,<itwe- , V c U can be expressed in 
terms of inner products (Theorem ^5), it i .7^'»£ t±ax the transformation also 
preserves that magnitude. 

Corollary 5-8-1 . If a linear trans .-m« ^-^ por-^rves the length of every 
vector, then it maps orthogonal vectors ia^?:; vectors. 

By the definition of orthogonality, th^i impl / ^i^ans that the geometric 
vectors are mutually perpendicular. 

It is very easy to show the trans formai::: i i considering also pre- 

serve the distance between every pair of pori> i the plane. We state this 
property formally in the next theorem, the - - > v vh:ch is left as an exercise. 



Theorem 5—9. A linear transformation 



orce^er-ies the length of every 
i^v-r ,r^air of points in the plane; 



vector leaves unchanged the distance betweer 
that is, if V and U' are the respective >Hi*get? the vectors V and U, 
then 



Let U3 now find a matrix representing any ^zix^^n linear length-preserving 
transformation of H. All we need to find are the Images of the vectors 



and 



^2 = 



under such a transformation. (Why is this so?) 

If SJ^ and are the respective images 

that both SJ^ and S.^ are of length 1 and t 
other. 



and S. .. then we know 
:rtr/Hy are ort&ogonal to each 



Suppose that S[ forms the angle a (alpaa) with the positive half of 
the X axis (Figure 5-12). Since the length of S[ equals 1, we have 



cos Ot 
sin a 



We know that S' is perpendicular to S[. Hlsnce, there are two opposite 



223 

[sec. 5-fct] 




Figure 5-12. A length-preserving transformation, 

possibilities for the direction of S', because the angle p (beta) that 
makes with the positive half of the axis may be either 

s 

p - a + 1 

or 

P = a - ^ . 

In the first case (5), we have 





COS :p 




COS 








^in 


a 




sin iS 


D 


















sin 








cos 


a 



In the second case (6), we have 





COS 






Sin 


a 


s 






m 








sin 


r 




-cos 


a 










> 



224 



a6 

Accordingly, any linear trsr.s formation f that leaves the leng-Ji q£ each 
vector unchanged must be -apTSsented by a matrix having t. sr the fcrm 



cos a — sin o: 
sir a COS a 



(7) 



or the form 



IS a sin a 
L-a O: —cos a 



(8) 



In the first instance (7), the nrans formation f simply rotsntes the basis 
vectors and through -nn angle a and we suspect that f is a rotation 

of the entire plane H through thai: angle. To verify this observation-, we write 
the vector V in terms of its angle of inclination 0 (theta) to the y. axis 
and the length r = IIVII; that is, we write 



cos G 
sin G 



(9) 



Forming AV from equations (7^ and (9), we obtain 



AV = 



r( rr=; G cos a — sin G sin a) 
r(sii: G cos a + cos G sin a) 



From the formulas x± trigonometry, 



cos (G + a) = cos G cos a - sin G sin a, 
sin (9 + a) sin 0 cos a + cos G sin a, 



we see that 



AV 



r cos (G + a) i 
.r sin (G + a) ] 



Thus, AV is the vector of length r at an angle G + a to the horizontal 
axis. We have proved that the ^trix A repissjrrs a -^.tation of H through 
the angle a. 

But suppose f is represented by the matrix B in equation (8) above. 
This transformation differs froE ti^ one represented by A in thaz-ithe vector: 
S' is reflected across the line of ^e vector CossKeqnently , 3°^- may 

suspect that this trans formatirGsr amounts to a reflection (of the pILsnfi. in the 

[sec. 2-6] 
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It axis foLiov«r.tl by a rot:-: ::, on thrraagh th^i angle a. Sine- you; know that the 
reflection in the x axi: is represented by the matrix 



J ^ 



I: 



0 — L 



you may thererare expec : that 



(10) 



We leave this verification zs an 



EiiaxclssFS 5—6 



1. Obtain the matrices that rotate H cirrough the foLLowing-smgles: 

(a) 180°, (f) 90°, 

(b) 45°. (g) -120°, 

(c) 30°. (h,^ 360°, 

(d) 60°, (1) -135°. 
. (e) 270°, ( j) 150°. 

2. Write out the matrices tha.t "Tspreseirtr the r—rms formation: consisting of a 
reflection in the x axis Allowed 'by the stations c£. Exercise 1. 

3. Verify Equation (10), abo^. 

i 

4. A Linear transfomnation of ^ that preseirwra. the iiCin^r^ii of every vectox is 
called an orthogonal cxatiEFtoraation, and ttematixiir^ reroasenting the* trans- 
formation is called an .artagjSDn^ matrix^ Provs that :=ne transpose of an 
orthogpnal matrix d& tlcthosiSEL^ 

5. Show feat the invsrse of n^' t ni i^g nngi - m ^t - - r v ±3 anr ort luman a L matrix • 

6. Show that the prcnruct of tawr^ jixaa gonal narnrLces t« orthaganal.. 

7. (a) Siiow that a tn:anslattoTr:;i± R in tte 5±rs=rlan'of vector 



and through a distance equal to the length of U is givem by the mapping 

[sac. 5-6] 
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V — ^ V - u. 



(b) Show that rhis mapping does not preserve the lengrh of every vector, 
but that it doe^ preserve the distance between every parr of points in the 
p lane . 

(c) Determine whether or not this mapping is linear. 

8. Let R and R denote rotations of H through ::he angles CC and p, 
respectively. Prove that a rotation through a -^ollcuved ny a rotation 
through P amounts to a rotation through a + thnc is, show thst 



10. 



Note that the matrix A of iik}iiation (7) is a represemtation of a complex 
number. What does the result of Exercise 8 imply for z=mpXex numbexBl 

(a) Find a matrix that represents a reflection acxoss itixe line of tiia 
vector 



cos a 

sin a 



(b) Show that the matrix B of equation (8), aiirove, represents a re- 
flection across the line of some vector. 
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Appendix 
RESEARCH EXERCISES 

~ ~ The exercises in this Appendix are essentially '*researtSi---type'' problems 
designed to exhibit aspects of theory and practice in matrix algebra that '^-rnr 'l d 
not be included in the text. They are especially suited as individual ass:Egn— 
ments for those students who are prospective majors in the theoretical and 
practical aspects of the scientific disciplines, and for scmidents who wouic iike 
to test their mathematical powers. Alternatively, small groups of students 
might join forces in working them. 

1 . Quaternions . The algebraic system that is explorsri in this exercise 
was invented by the Irish mathematician and physicist, WfXlxfisa Rowan Hamilton, 
who published his first paper on the subject in 1835. It; was not until 1858 
that Arthur Cayley, an English mathematician and lawyer, puhiiished -the first 
research paper on matrices, though the name matrix had pres^/lously been appiiesi 
^by James Joseph Sylvester in 1850 to rectangular arrays of numbers. Sisrce 
Hamilton's system of quaternions is actually an algebra of matric^., it is mnce: 
easily presented in this guise than in the form in which It:: va* Siirst c ™n»»i-!r^ ^? 

In the present exercissa, we shall consider: the algebra 1 2 :iaatr±ce« 
with complex numbers as entries . The definitions of addltior*,. mr^l :r i p 1 -^^T-fltf mrr ^ 
and inversion remain the same. We use C for:- the set of all mi^T^^ px -n nml m 
and we denote by K the set of all matrices 



■where z, w, z^^ and Wj^ are elements of C. As is the case with matricra. 
having real entries, tiie element 



of K has an inverse if and only if 



zWj^ - wZj^ f 0, 
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and then we have 



z w 
^1 ^ 



-L 



Since L is a complex number, the unit matrix is still 



1 0 
0 1 



If 



z = X + iy, 



then we write 



z = X iy 



and call this liumber tne complex conjugate of z, or simply the conjugate of z 
A quaternion is /an element q of K of the particular form 



z w 
-w z 



y zee and w € C. 



We denote by Q the set of all quaternions. 

(a) Show that 6(q) « x^ + y^ + u^ + if z « x + iy and w « u + iv. 
Hence conclude that, since x, y, u, and v are real numbers, 6(q) « 0 if 
and only if q 2* 

(b) Show that if q e Q, then q has an inverse if and only if q !* 0 , 
and exhibit the form of q ^ if it exists. 

Four elements of Q are of particular importance and we give them special 



names: 



1 0 
0 1 



w « 



r 

i 

[l ol 

[o -ij ' 

0 ll 

-1 oj • 

■o ll 

1 oJ • 
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(c) Show that if 



z w 



where z « x + iy and w - u + Iv, then 

q « xl + yU + uV + vW. 

(d) Prove the following identities involving I, U, V and W: 

UV«W«-VU, VW=U«-WV, and WU«V«-UW. 

(e) Use the preceding two exercises to show that if q € q and r 6 Q, 
then q + r, q - r, and qr are all elements of Q. 

The conjugate of the element 



and 



z w 



where z = x + iy, w = u + iv, 



is 



2 -W 

w z 



and the norm and trace are given respectively by 
and 



t(q) » 2x. 



(f) Show that if q e Q, and if q is invertible, then 



-1 1 - 
q = J ^ 

Iql 
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From this conclude that if q € Q, and if q"^ exists, then q ^ € Q, 

(g) Show that each q e Q satisfies the quadratic equation 

2 2 
q - t(q) q + Iql I « 0, 

(h) Show that If q € q, then 

qq = Iql I. 

Note that this may be proved by using the result that if 

q « al + bU + cV + dW, 

then 

q o al - bU - cV - dW, 

and then using the results given in (d) • 

(1) Show that if q € Q and r € Q, then 

Iqrl « Iql Irl 

and 

Iq + rl < Iql + Irl . 

The geometry of quaternions constitutes a very interesting subject. It 
requires the representation of a quaternion 

q = al + bi; + cV + dW 

as a point with coordinates (a, b, c, d) in four-^dimensional spaces. The 
subset of elements, 

Q- - {q: q € Q and Iql « 1], 



is a group and is represented geometrically as the hyper sphere with equation 
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a + b + c 



+ d » L, 



2* Nonassociattve Algehvras 

The algebra of matrices we restrict ot=r attention ±n this exercise to the 
set M of 2x2 matrices) has an associative but naf xl commutative multipli- 
cation. "Algebras" with nonassociative mul^iplicatiois: :hsive become increasingly 
important in recent years — fcr example, in mathematical genetics. Genetics is 
a subdiscipline of biology and is concerned, wife txansmisBion of hereditary— 
traits. Nonassociative "al^bras" are impor.tant.ialso in the study of quantum 
mechanics, a subdiscipline of physics. We rgi-ra axst a.simple example of a Lie 
algebra (named after the Norwegian geometer Sajmiis Lie). 

If A C M and B € M, ^fie write 

AoB « AB - BA 

and read this "A op B, " "op" being an g'^fiT.-r.(±d/.ir a ri . rnn forr operation . 

(a) ' Prove the following properties xsf err 

(i) AoB " - BoA, 
(ii) AoA » 0, 
(iii) Ao(BoC) + Bo(CoA) + Co(Ai2B) = 0, 
(iv) Aol = 0 « loA.. 

(b) Give an example to show that ^(BaC) and (AoB)oC are different 
and hence that o is :not an associative xroexatton. 

Despite these stzange properties., o bBha^s nicely relative to ordinary 
matrix addition. 

(c) Show that o distributes ovex addisinnn 

Ao(B + C) « (AcSi) -f (AoC) 

and 

CA + B)oC « (AaC) -f CBoC). 

(d) Show that o b^oEsres nicely reletSwe to multiplication by a number. 
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It will be recalled that A ^ is termed the multiplicative inverse of A 
and is defined as the element B satisfying the relationships 

AB = I « BA. 

But it njust also be recalled that this definition was motivated by the fact that 

AI » A « lA, 

that is, by the fact that I is a multiplicative unit, 
(e) Show that there is no o unit. 

We know, from the foregoing work, that o is neither commutative nor 
associative. Here is another kind of * operation, called Jordan multiplication ; 
If A € M and 0 € M, we define 

(AB + BA) 
AjB « 2 * 

We see at once that 

AjB » BjA, 

so that Jordan multiplication is a commutative operation; but it is not 
associative. 

(C) Determine all of the properties of the operation j that you can. 
For example, does j distribute over addition? 

3. The Algebra of Subsets 

We have seen that there are interesting algebraically defined subsets of 
M, the set of all 2x2 matrices. One such subset, for example, is the set 

which is isomorphic with the set of complex numbers. Much of higher 
mathematics is concerned with the "global structure'' of "algebras," and generally 
this involves the consideration of subsets of the "algebras" being studied. In 
this exercise, we shall generally underscore letters to denote subsets of M. 
If A and B are subsets of M, then 

A + B 
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y Is the set of all elements of the form 

A + B, where A € A and B e B. 
; In 8et--bullder notation this may be written 

A + B » {A + B : A € A and B 6 B ) . 
By an additive subse t of M Is meant a subset A C M such that 

A + A C A. 

(a) Determine which of the following are additive subsets of M: 

(i) (0), 
(11) {I), 
(111) M, 
(Iv) Z, 

(v) M^, the set of all A In M with 5(A) = 1, 
(vl) the set of all elements of M whose entries are nonnegatlve. 

(b) Prove that If A, B, and C are subsets of M, then 

(1) A + B « B + A, 
(11) A + (B + C) « (A + B) + C, 
(ill) and If A C B then A + C C B + C. 

(c) Prove that If A and B are additive subsets of M, then 

A + B 

is also an additive subset of M. 

Let V denote the set of all column vectors 

X 

with X € R and y € R. 

(d) Show that If v is a fixed element of V, then 
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* _ _ ' 

' A: A e M and Av « q ► 

V, 

is an additive subset of M. Notice also that if Av = 0 then (~-A)v = 0. 
If A and B are subsets of M, then 

AB 

is the set of all 

AB, A e A and B € B. 
Using set-builder notation, we can write this in the form 

AB = (AB: A e A and B e B} . 
A subset A of M is multiplicative if 

AA C A. 

(e) Which of the subsets in part (a) are multiplicative? 

(f) Show that if A, B, and C are subsets of M, then 

(i) A(BC) ^ (M)£» 
(ii) and if A C B, then AC C BC 

(g) Give an example of two subsets A and B of M such that 

AB ^ BA. 

(h) Determine which of the following subsets are multiplicative: 

(i) (0, 1), 
(ii) (I, 

(iii) the set of all elements of M with negative entries, 
(iv) the set of all elements of M for which the upper left-hand 
entry is less than 1, 
(v) the set of all elements of M of the form 
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X y 
0 ij ' 

with 0 < X, 0 < y, and x + y < 1. 

The exercises stated above are suggestions as to how this "algebra of 
subsets" works. There are many other results that come to mind, but we shall 
leave them to you to find. Here are some clues: How would you define tA If 
t € R and A C M? Is (-l)A = - A? Walt a minute I What does -A mean? 
What does A'^ mean? Does set multiplication distribute over addition, over 
union, over intersection? Do not expect that even your teacher knows the 
answer to all of these possible questions. Few people know all of them and 
fewer still, of those who know them, remember them. If you conjecture that 
something is true but the proof of it escapes you, then try to construct an 
exaniple to show that it is false. If this does not work, try proving it again, 
and so on. • 

4. Analysis and Synthesis of Proofs 

This is an exercise in analysis and synthesis, taking an old proof to 
pieces and using the pattern to make a new proof'. In describing his activities, 
a mathematician is likely to put at the very top that of creating new results. 
But "result" in mathematics usually means "theorem and proof." The mathematician 
does not by any means limit his methods in conjecturing a new theorem: He 
guesses, uses analogies, draws diagrams and figures, sets up physical models, 
experiements, computes j no holds are barred. Once he has his conjecture firmly 
in mind, he is only half through, for he still must construct a proof . One way 
of doing this is to analyze proofs of known theorems that are somewhat like the 
theorem he is trying to prove and then synthesize a proof of the new theorem. 
Here we ask you to apply this process of analysis and synthesis of proofs to 
the algebra of matrices. To accomplish this, we shall introduce some new 
operations among matrices by analogy with the old operations. 

For simplicity of computation, we shall use only 2x2 matrices. 

To start with, we introduce new operations in the set of real numbers , R. 
If X € R and y e R, we define 

X A y the smaller of x and y (read; "x ca£ y") 

and 
t 

X V y *■ the larger of x and y (read: "x cup y") . 
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(a) Show that if x e R, y € R, and z e R, then 
(i) X A y = y A X, 
(ii) X V y « y V X, 
(iii) X A (y A z) « (x A y) A z, 
(iv) X V (y V z) « (x V y) V z, 

(v) X A X = X, 

(vi) X V X = X, 

(vii) X A (y V z) » (x A y) V (x A z), 
(viii) X V (y A z) « (x V y) A (x V z) . 

Although the foregoing operations may seem a Little unusual, you v?ill have 
no difficulty in proving the above statements. They are not difficult to • 
remember if you notice the following facts: 

The even-numbered results can be obtained from the odd'-numbered results by 
interchanging A and V , and conversely. 

The first states that A is commutative and the third states that A is 
associative. The fifth is new but the seventh states that A distributes over 
V , 

To define the matrix operations, let us think of A as the analog of 
multiplication and V as the analog of addition and let us begin with our new 
matrix "multiplication,'^ 

We define 

(a A x) V (b A z) (a A y) V (b A w)' 
(c A x) V (d A a) (c A y) V (d A w)J * 

This is simply the row-by— column operations, except that A is used in 
place of multiplication and V is used in place of addition. To see this more 
clearly, we write 

ax + bz ay + bw 
cx + dz cy + dw 

(b) Write out a proof that if A, B, and C are elements of M, then 

A(BC) « (AB)C, 



a b 

c d 



I] 



a b 

c d 



X y 
z w 
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:Be sure not to omit any steps in the proof. Using this as a pattern, writ;^ out 
a proof that 

A A (B A C) « (A A B) A C, 

verifying at each step that you have the necessary results from (a) to make the 
proof sound. List all the properties of the two pairs of operations that you 
needy such as associativity, commutativity , and distributivity . 

(c) Using the analogy between V and addition, define A V B for elements 
A and B of M. 

(d) State and prove, for the new operations, analogs of all the rules 
you know jlor the operations of matrix addition and multiplication. 
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Abel Ian group, 90 
Addition of matrices, 9 

associative law for, 12 
, commutative law for, 12 
Addition of vectors, 147 
Additive inverse of a matrix, 14 
Additive subset, 225 
Algebra, 100, 102 

global structure of, 224 

nonassociative, 223 
Analysis of proofs, 227 
Analysis, vector, 175 
Angle between vectors, 15^ 
Anticommutative matrix, 49 
Area of a piirallelogram, 161 
Arrow, 136 

head, 13t 

tail, 136 
Associative law, for addition, 12 
Associative law, 

for multiplication, 43-46 
Basis, 171 

natural, 171 
Cancellation law, 37 
Cap, 227 

Cayley-Hamilton Theorem, 210-211 
Characteristic equation, 208 
Characteristic root, 206 
Characteristic value, 205—207 
Characteristic vector, 205-207 
Circle, unit, 88 
Closure, 53 

Collinear vectors, 155 
Column matrix, 4 
Column of a matrix, 2 
Column vector, 4,- 133 
Combination, linear, 170 
Commutative group, 90 
CommutativtJ law for addition, 12 
Complex coujugate, 220 
Complex number, 1, 94, 219 
Components of a vector, 137 
Composition of trans formations , 198- 
Compression , 185 

Conformable matrices, for addition, 

for multiplication, 27 
Conjugate quaternion, 221 
Contraction factor, 180 
Cosine, direction, 138 
Cosines, law of, 153 
Counting number, 1 
Cup, 227 



Decimal, infinite, 1 
Dependence, linear, 172 
Determinant function, 77 
Diagonalization method, 131 
Difference of matrices, 14 / 
Direction cosines, 138 
Direction of a vector, 138 
Displacement, 184 
Distributive law, 41-45 
Domain of a function, 177 
Dot product of vectors, 154 
Eigenvalue, 206 
Electronic brain, 2, 132 
Elementary matrices, 124 
Elementary row opesration, 114, 124 
Embedding of an aigebra, 100 
End point of vector, 137 
Entry of a matriy-^. 3 
Equality of matrices, 7 
Equation, characteristic, 208 
Equivalence, row, 114 

Equivalent systens; of linear equations, 
105 

Equivalent vectors, 138 
Expansion factor, 179 
Factor, contraction, 180 

expansion, 179 
Field, 55 
Fixed point, 206 
Four— dimensional space, 222 
Free vector, 175 
Function, 177 

determinant, 77 

domain, 177 

matrix, 109 

range, 177 

real, 177 

stretching, 183 

vector, 177 
Galois, Evariste, 92 

Geometric representation of vector, 140 
Global structure of algebras, 224 
•199 Group, 85, 90 
abelian, 90 
10 commutative, 90 

of invertible matrices, 85 
related to face of clock, 90 
Head of arrow, 136 
Uypersphere, 222 

Identity matrix, for addition, 11 

for multiplication, 46 
Image, 178 
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Independence, linear, 172 
Infinite decimal, 1 
Initial point or vector, 137 
Inner product of vectors, 152, 154 
Integer, 1 

Invariant sub space, 211 

Invariant vector, 209 

Inverse of a matrix, 62—63, 113 

of order tV70, 75 , 
Inverse of a number, 54 

of a transformation, 204 
Isomorphism; 94, 100 
Jordan multiplication, 224 
Kernel, 196 
Law of cosines, 153 
Left multiplication, 37 
Length of a vector, 137—138 
Linear combination, 170 
Linear dependence, 172 
Linear equations, system of, 103, 119 

equivalent, 105 

solution of, 103-105 

relation to matrices, 107 

solution by diagonalization method, 
131 

solution by triangularization method, 
152 

Linear independence, 172 

Linear map, 190, 196 

Linear transformation, 190, 196 

Located vector, 136—137 

Map, 178 

inverse , 204 
kernel, 196 
linear, 190, 196 

one— to-one, 201 
onto, 178 
Matrices, 1, 3 
Matrix, 1, 3 
addition, 9 

associative law for, 12 
commutative law for, 12 
conformalbility for, 10 
identity element for, 11 
additive inverse, 14 
anticommutative , 49 
column, 4 
column of, 2 

conformable for addition, 10 
difference, 14 
division, 50-51 
elementary, 124 
entry of, 3 
equality, 7 

identity for addition, 11 
for multiplication, 46 



Matrix (continued), 
inverse, 62—63, 113 

of order two, ,75 
invertible, 63 . 
multiplication, 24, 30, 32 
cancellation law for, 37 
conformability for, 27 
as^t, 37 
right, 37 
' multiplication by a number, 19—20 
negative of, 14 
order of , 3 
orthogonal, 217 
product, 26 
row, 4 . 
row of, 2 
square, 4 

order of , 4 
sum, 10 

transformation, 189 

transpose of, 5 

unit, 46 

variable, 109 

zero, 11 
Matrix j&unction, 109 
Multiplication, 24, 30, 32 

Jordan, 224 
Multiplication of matrices, 24, 30, 32 

distributive law for, over addition, 
41-45 

Multiplication of matrix by number, 
19-20 

Multiplication of vector by number, 
144 

Natural basis, 171 
Negative of a matrix, 14 
Ncnassociative algebra, 223 
Norm of a quaternion, 221 
Norm of a vector, 141 
Null vector, 139 
Number, 1 

Number, complex, 1, 94 
conjugate, 220 
counting, 1 
integer, 1 
inverse, 54 
rational, 1 
real, 1 

One— to— one transformation, 186 
Op e ra t ion , row , 1 14 , 1 24 
Opposite vectors, 138 
Order of a matrix, 3, 4 
Orthogonal matrix, 217 
Orthogonal projection, 186 
.Orthogonal transformation, 217 
Orthogonal vectors, 156 
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Parallel vectors, 143, 163 
Parallelogram rule, 149 
Perpendicular projection, 186 
Perpendicular vectors, 153 
Pivot, 132 
Point, fixed, 206 
Product of transformations, 198 
Projection, 

orthogonal, 186 

perr. ndicular, 186 
Ouatemion, 219—223 

conjugate, 221 

geometry of, 222 

norm, 221 

trace, 221 
Range of a function, 177 
Rational number, 1 
Real function, 177 
Real number, 1 
Reflection, 178, 212 
Representation of vector, 140 
Right multiplication, 37 
Ring, 57-58 

with identity element, 60 
Rise, 137 

Root, characteristic, 206 

Row equivalent, 114 

Row matrix, 4 

Row of a matrix, 2 

Row operation, 114, 124 

Row vector, 4 

Rotation, 198, 212 

Run, 137 

Scalar, 195 

Set, 53 

closure under an operation, 53 

element of, 57 
Shear, 183 
Sigma notation, 30 
Slope of a vector, 138 
Space, 166 

four-dimensional , 222 
Square matrix, 4 
Square root of unit matrix, 39 
Standard representation, 140 
Stretching function, 183 
Subset, 

additive, 225 

algebra of, 224 
Subspace, 166—167 

invariant, 211 
Sum cf matrices, 10 
Synthesis of proofs, 227 
System of linear equations, 103, 119 

solution by diagonalization method, 
131 



System of linear equations (continued) 
solution by triangularirrfrrLon method 
132 

Hail of vector, 136 
Terminal point of vector, 137 
Trace of a quaternion, 221 
Transformation, 

composition, 198 

geometric, 177—178 

inverse , 2U4 

kernel, 196 

1 eng th-p res e rv ing , 2 14— 2 17 

linear, 190, 196 
one— to— one, 201 

matrix, 189 

one— to~one, 186 

orthogonal, 217 

plane, 177-178 

product, 198 
Translation, 184 
Transpose of a matrix, 5 
Triangularization method, 132 
Unit circle, 88 
Unit matrix, 46 
Value , 

characteristic , 205—207 
Variable, 

matrix, 109 
Vector, 4, 133 

addition, 

parallelogram rule for, 149 

analysis, 175 

angle, 153 

basis, 171 

characteristic , 205—207 
col linear, 155. 177 
column, 4, 133 

order of, 133-134 
component, 137 
direction, 138 
dot product, 154 
end point, 137 
equivalent, 138 
free, 175 
function, 177 

geometric representation, 136 

initial point, 137 

inner product, 152, 154 

invariant, 209 

length, 137-138 

linear combination, 170 

located, 136-137 } 

multiplication by a number, 144 

natural basis, 171 

norm, 141 

null, 139 
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Vector (continued), 
opposite, 138 
orthogonal, 156 
parallel, 143, 163 
perpendicular, ^153 

representation by located vector, 140 
rise, 137 
row, 4, 136 



Vector (continued), 

run, 137 

slope, 138 

space, 166 

subspace, 166—167 

teminal point, 137 

variable, 177 
Zero matrix, 11 



